Experiencing Latency for Multiple Functional Areas – 03/02 – Resolved

Final Update: Thursday, 03 March 2016 01:17 UTCWe've confirmed that all systems are back to normal with no customer impact as of 3/3, 00:00 UTC. Our logs show the incident started on 3/2, 21:30 UTC and that during the 2.5 hours that it took to resolve the issue, a subset of our customers would have experienced latency outside of SLA for all the data types.

  • Root Cause: The failure was due to a faulty piece of code inadvertently making its way to the processing service.
  • Lessons Learned: We will now focus our efforts to re work our test systems to catch similar bugs in stage environments prior to production.
  • Incident Timeline: 2 Hours & 30 minutes - 3/2, 21:30 UTC through 3/3, 00:00 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Arun Jolly

Update: Thursday, 03 March 2016 00:12 UTCRoot cause has been isolated to a faulty deployment which was affecting our processing service. To address this issue we rolled back the faulty deployment. Our processing components are now working as expected. A very small subset of our customers may experience a minor gap in their data ingested between 3/2 20:00 UTC and 3/2 22:00 UTC.

  • Next Update: Before 03/03 02:30 UTC

-Arun Jolly

Initial Update: Wednesday, 02 March 2016 21:33 UTCWe are aware of issues within Application Insights and are actively investigating. Some customers may experience latency outside of SLA for data ingested into App Insights. The following data types are affected: Availability,Customer Event,Dependency,Exception,Metric,Page Load,Page View,Performance Counter,Request,Trace.

  • Next Update: Before 03/03 00:00 UTC

We are working hard to resolve this issue and apologize for any inconvenience.
-Arun Jolly

Skip to main content