Experiencing Data Latency for Many Data Types – 5/22 – Resolved


Final Update: Friday, 5/22/2015 05:11 UTC

We’ve confirmed that all systems are back to normal with no customer impact as of 05/22/15 04:30 UTC. The root cause of the issue was the optimization job which when stopped helped the pipeline to recover.

 Now all the telemetry data types are within the SLA of 2 hours.

Chance of Reoccurrence: Low
Lessons Learned: Assess the impact of this optimization job before attempting again

Incident Timeline: 05/22/15 00:59 UTC through 05/22/15 04:30 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Application Insights Service Delivery Team



Initial Update: Friday, 5/22/2015 01:09 UTC

Application Insights recovered from the data latency at 05/21/15 21:10 UTC but is now latent again due to a new issue. After recovering from previous incident we enabled an optimization job which resulted in too much load on part of our service impacting indexing of data. We've stopped the optimization job and are working to recover the indexing health.

Work Around: None
Next Update: Before 5/22/15 06:00 UTC

We are working hard to resolve this issue and apologize for any inconvenience.

-Application Insights Service Delivery Team

 
 
 

Skip to main content