Final Update: Saturday, 05 March 2016 00:45 UTC
We’ve confirmed that all systems are back to normal with no customer impact as of 3/4, 18:30 UTC. Our logs show the incident started on 3/4, 16:00 UTC and that during the 2.5 hours that it took to resolve the issue, a subset of our customers would have seen latency for their data due to retries while sending data to the Application Insights endpoint.
Incident Timeline: 2 Hours & 30 minutes – 3/4, 16:00 UTC through 3/4, 18:30 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
Update: Friday, 04 March 2016 19:40 UTC
Root cause has been tentatively isolated to upgrades in one of our back end messaging systems which has now finished. Our customers will no longer see any data gaps. As residual impact, a few customers may observe multiple retries while sending data to the Application Insights endpoint. We are actively working on resolving these residual issues and will provide an update in another 6 hours about the progress made.
Next Update: Before 03/05 02:00 UTC
Initial Update: Friday, 04 March 2016 15:48 UTC
We are aware of issues within Application Insights and are actively investigating. Some customers may experience Data Gaps. The following data types are affected: Availability,Customer Event,Dependency,Exception,Metric,Page Load,Page View,Performance Counter,Request,Trace.
•Work Around: None
•Next Update: Before 03/04 20:00 UTC
We are working hard to resolve this issue and apologize for any inconvenience.