Experiencing Alerting failure for Availability Data Type – 07/02 – Resolved

Final Update: Saturday, 02 July 2016 04:30 UTC

We've confirmed that all systems are back to normal with no customer impact as of 07/01 4:25 AM UTC. Our logs show the incident started on 07/01, 3:00 AM UTC and that during the 1 and half hours that it took to resolve the issue some of the customers experienced alerting failure for the tests running in Miami region.
  • Root Cause: The failure was due to  an issue with underlying infrastructure running these tests.
  • Lessons Learned: We are investigating additional improvements to internal telemetry to speed diagnosis and tooling to aid in implementation of mitigation steps.
  • Incident Timeline: 1 Hour & 30 minutes - 07/01, 3:00 AM UTC through 07/01, 4:25 AM UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.


Initial Update: Saturday, 02 July 2016 04:00 UTC

We are aware of issues within Application Insights and are actively investigating. Some customers may experience Alerting failure for the tests running in the Miami location.

The following data types are affected: Availability.
  • Work Around: None
  • Next Update: Before 07/02 06:00 UTC

We are working hard to resolve this issue and apologize for any inconvenience.


Skip to main content