Experiencing Alerting failure – 01/13 – Resolved

Final Update: Wednesday, 13 January 2016 22:22 UTC

We've confirmed that all systems are back to normal with no customer impact as of 1/13, 22:00 UTC. Our logs show the incident started on 1/13, 04:00 UTC and that during the incident impact window,our customers would have experienced failure when creating alerts for webtests & metrics.
  • Root Cause: The failure was due to a recent change that was made in the system.
  • Lessons Learned: We are collecting additional telemetry to improve the monitoring options.
  • Incident Timeline: 18 Hours  - 1/13, 04:00 UTC through 1/13, 22:00 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Application Insights Service Delivery Team

Update: Wednesday, 13 January 2016 20:49 UTC

Root cause has been isolated to a recent change that was introduced in the system. To address this issue we are working on preparing a fix for it. It is estimated to take 3 hours for the fix to be deployed.Customers may still experience issue when creating alerts until the fix is rolled out completely.

  • Work Around: None
  • Next Update: Before 01/14 00:00 UTC

-Application Insights Service Delivery Team

Initial Update: Wednesday, 13 January 2016 19:31 UTC

We are aware of issues within Application Insights and are actively investigating. Customers may be unable to create alerts for webtests.
  • Work Around: None
  • Next Update: Before 01/13 23:00 UTC

We are working hard to resolve this issue and apologize for any inconvenience.
-Application Insights Service Delivery Team

Skip to main content