Experiencing Alerting failure for Availability Data Type – 07/01 – Resolved


Final Update: Friday, 01 July 2016 21:30 UTC

We've confirmed that all systems are back to normal with no customer impact as of 07/01, 9:30 PM UTC. Our logs show the incident started on 07/01, 7:37 PM UTC and that during the ~2 hours that it took to resolve the issue some customers experienced Alerting failure for tests running in the TX-San Antonio location.

  • Root Cause: Preliminary root cause analysis indicates an issue with underlying infrastructure running these tests.
  • Lessons Learned: We are investigating additional improvements to internal telemetry to speed diagnosis and tooling to aid in implementation of mitigation steps.
  • Incident Timeline: 1 hours 53 minutes - 07/01, 7:37 PM UTC through 07/01, 9:30 PM UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Sapna


Skip to main content