Experiencing Data Access Issue in Azure Portal for Many Data Types - 11/11 - Resolved


Final Update: Thursday, 12 November 2015 04:31 UTC

We've confirmed that all systems are back to normal with no customer impact as of 11/12, 04:00 UTC. Our logs show the incident started on 11/11, 16:50 UTC and that during the 11 hours that it took to resolve the issue some of our  customers experienced some kind of data latency.

  • Root Cause: Full RCA pending.
  • Incident Timeline: 11 Hours & 10 minutes - 11/11, 16:50 UTC through 11/12, 04:00 UTC

We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.

-Application Insights Service Delivery Team


Update: Thursday, 12 November 2015 03:01 UTC

Recovery is taking longer than expected. We have processed most of the data and continue to monitor the recovery,
We apologize for any inconvenience caused.

  • Work Around: None
  • Next Update: Before 11/12 06:30 UTC

-Application Insights Service Delivery Team


Update: Wednesday, 11 November 2015 23:44 UTC

We see that processing for backlog data is continuing as expected. We are halfway through the processing and it's expected to take another 3 hours to process the rest of the backlog data. We will continue to monitor for complete recovery.

  • Work Around: None
  • Next Update: Before 11/12 03:00 UTC

-Application Insights Service Delivery Team


Update: Wednesday, 11 November 2015 21:29 UTC

The Azure networking interruption has been mitigated by our Azure partner team.  All data is current since 19:10 UTC and the entire service is again working as expected.  Some customers may experience data latency for all data types between approximately 17:15 UTC and 19:10 UTC while we process through the backlog of data.

Based on current backlog processing velocity, we expect that all backlogged data will be current within approximately 5 hours.

  • Work Around: none
  • Next Update: Before 11/11 23:30 UTC

-Application Insights Service Delivery Team


Update: Wednesday, 11 November 2015 19:19 UTC

Our DevOps team continues to investigate issues within Application Insights. Root cause is understood to be impact from an Azure network interruption. We continue to work with our Azure partners who have applied some potential mitigations.

We are seeing some level of recovery in our service health monitors, but need to continue to assess progress towards recovery.  Some customers may continue to experience intermittent UX errors and should expect their data to be latent due to backlog in our pipeline.

We are working to establish the start time for the issue, initial findings indicate that the problem began at 11/11, ~16:52 UTC.  We currently have no estimate for resolution.

  • Work Around: none
  • Next Update: Before 11/11 21:30 UTC

-Application Insights Service Delivery Team


Initial Update: Wednesday, 11 November 2015 17:26 UTC

We are aware of issues within Application Insights and are actively investigating. Some customers may experience Data Access Issue in Azure Portal. The following data types are affected: Availability,Customer Event,Dependency,Exception,Metric,Page Load,Page View,Performance Counter,Request,Trace.
  • Work Around:
  • Next Update: Before 11/11 19:30 UTC

We are working hard to resolve this issue and apologize for any inconvenience.
-Application Insights Service Delivery Team


Skip to main content