Experiencing Data Loss for Availability Data Type - 5/21 - Resolved


Final Update: Thursday, 5/21/2015 17:45 UTC

We’ve confirmed that all systems are back to normal with no customer impact as of 5/21, 17:20 UTC. Our logs show the incident started on 5/21, 16:17 UTC and that during the 1 hour and 3 minutes that it took to resolve the issue up to 33% of customers experienced data loss for web test availability data.

Root Cause: The failure was due to a group of servers experiencing authentication issues.
Chance of Reoccurrence: Medium
Lessons Learned: The monitoring for this issue worked well, and we are actively investigating the captured telemetry for a full RCA.
Incident Timeline: 1 Hours & 3 minutes - 5/21, 16:17 UTC through 5/21,  17:20 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Application Insights Service Delivery Team


Initial Update: Thursday, 5/21/2015 16:42 UTC

We are aware of issues within Application Insights and are actively investigating. Some customers may experience Data Loss. The following data types are affected: Availability.

Work Around: None
Next Update: Before 18:00 UTC

We are working hard to resolve this issue and apologize for any inconvenience.

-Application Insights Service Delivery Team

 

Skip to main content