Monitoring Issue showing data latency for Many Data Types – 01/30 – Resolved

Final Update: Saturday, 30 January 2016 08:26 UTC
We've confirmed that all systems are back to normal with no customer impact as of 1/30, 08:26  UTC. Our logs show the incident started on 30/1, 06:18 UTC and that during the impact window some customers may have seen latency banner while there was no latency in the pipeline. We have mitigated the issue in the service responsible for this banner. A permanent plan is yet to be deployed to the service which might take 1-2 weeks to get implemented.
During the impact windows there was no real issue with the service. This is the last communication for this issue. In case the issue re-occurs we will post new communication to make our customer aware of the same.
  • Root Cause: The failure was due to telemetry service which led to monitoring noise.
  • Lessons Learned: We have worked out a long term mitigation plan which once deployed will avoid future recurrence of such issues.
  • Incident Timeline: 2 Hours & 08 minutes - 01/30, 06:18 UTC through 01/30, 08:26 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Application Insights Service Delivery Team

Initial Update: Saturday, 30 January 2016 06:18 UTC

We are aware of issues within Application Insights and are actively investigating. During our investigation it was identified that our monitoring telemetry is currently having issues causing incorrect reporting to customers regarding Data Latency of multiple data types. We have confirmed that all our systems are currently functioning normally. We are currently evaluating options to fix telemetry service which will mitigate this monitoring issue.

  • Work Around: No Customer Impact.
  • Next Update: Before 01/30 12:30 UTC

We are working hard to resolve this issue and apologize for any inconvenience.
-Application Insights Service Delivery Team

Skip to main content