Experiencing Alerting Failures in Miami region – 03/30 – Resolved


Final Update: Wednesday, 30 March 2016 06:22 UTC

We’ve confirmed that all systems are back to normal with no customer impact as of 03/30, 06:25 UTC. Our logs show the incident started on 03/30, 04:40 UTC and that during the 1 hour 45 Minutes that it took to resolve the issue, customers whose web tests running in Miami region would have experienced web test samples failures.
  • Root Cause: The failure was due to a network connectivity issue which caused back end nodes to go offline  
  • Lessons Learned: We have collected telemetry logs and identified steps to be taken to avoid these kind of scenarios in future.
  • Incident Timeline: 1 Hour & 45 minutes – 03/30, 06:25 UTC through 03/30, 04:40 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Girish K


Initial Update: Wednesday, 30 March 2016 05:27 UTC

We are aware of issues within Application Insights and are actively investigating. We are unable to gather any web test samples running under Miami region.
  • Work Around: None
  • Next Update: Before 03/30 09:30 UTC

We are working hard to resolve this issue and apologize for any inconvenience.
-Girish K


Skip to main content