Experiencing Alerting failure for Many Data Types – 05/07 – Resolved


Final Update: Saturday, 07 May 2016 00:13 UTC

Starting at approximately 5/6 18:30 UTC, Application Insights customers might have experienced alerting failures for their new alerts configured between 5/6 18:30 UTC and 5/7 00:00 UTC. We’ve confirmed that all systems are back to normal with no customer impact as of 5/7, 00:00 UTC. Customers will however loose all new alerts configured during the course of this incident. 
  • Root Cause: The failure was root caused to a faulty deployment in our configuration management service.
  • Lessons Learned: We acknowledge the gap in monitoring here and is in the process of adding more telemetry to the pipeline to proactively detect and resolve such issues.
  • Incident Timeline: 5 Hours & 30 minutes – 5/6, 18:30 UTC through 5/7, 00:00 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Arun


Skip to main content