Experiencing Alerting failure for Metric Data Type – 8/19 – Resolved


Final Update: Thursday, 8/20/2015 05:39 UTC

We’ve confirmed that all systems are back to normal with no customer impact as of 8/20, 05:40 UTC. Our logs show that the incident started on 8/10, 19:00 UTC and that during the duration of the incident, a subset of our customers would have faced issues with the App Insights alerting feature based on the metrics data type.

Root Cause: The failure was due to a wrongly configured service in a new sub section in our data processing pipeline.
Lessons Learned: This incident has helped us identify gaps in monitoring for the new features that we introduce in our service. We remain all the more committed to fix these gaps and prevent such incidents in the future.
Incident Timeline: 10 days and 9 hours - 8/10, 19:00 UTC through 8/20, 05:40 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Application Insights Service Delivery Team


Update: Thursday, 8/20/2015 01:39 UTC

Our DevOps team continues to investigate issues within Application Insights. Root cause is not fully understood at this time. Some customers continue to experience alerting failures ( no alerting) for rules setup based on metrics data type. We are working to establish the start time for the issue. We currently have no estimate for resolution.

Next Update: Before 8/20 03:45 UTC

-Application Insights Service Delivery Team


Initial Update: Wednesday, 8/19/2015 23:02 UTC

We are aware of issues within Application Insights and are actively investigating. Some customers may experience Alerting failure for alerting setup based on the Metric data type.

Next Update: Before 00:15 8/20 UTC

We are working hard to resolve this issue and apologize for any inconvenience.

-Application Insights Service Delivery Team

 

 


Skip to main content