Experiencing Data Latency for Multiple Data Type – 9/18 – Resolved


Final Update: Friday, 9/18/2015 05:31 UTC

We have confirmed all systems are back to normal with no customer impact as of 9/18, 04:25 UTC. Our logs show the incident started on 9/18, 00:12 UTC.  During the 4 hours that it took to resolve the issue customers uploading data through the South Central US collection endpoint may have experienced up to 2% of their Metrics, Dependency, and/or Trace data latent by more than 2 hours.  Other data types in South Central US and all data types from other regions were unaffected.

Root Cause: The failure was due to a partial failure of the collection services in South Central US.  This failure was eventually mitigated by an automated reimage of the failing components. 
Lessons Learned: We are continually improving our monitoring and diagnostic capabilities, and expect to be able to detect and resolve this failure mode in the future before any customer impact is incurred.
Incident Timeline: 4 Hours & 13 minutes - 9/18, 00:12 UTC through 9/18, 04:25 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Application Insights Service Delivery Team


Initial Update: Friday, 9/18/2015 02:27 UTC

We are aware of issues within Application Insights and are actively investigating. Some customers may experience Data Latency. The following data types are affected: Metric , Dependency , Trace
Work Around: none
Next Update: Before 04:30 UTC

We are working hard to resolve this issue and apologize for any inconvenience.

-Application Insights Service Delivery Team


 

 
 

Skip to main content