We've confirmed that all systems are back to normal with no customer impact as of 01/25, 12:55 UTC. Our logs show the incident started on 01/25, 12:30 UTC and that during the 25 minutes that it took to resolve the issue 6% of customers experienced data access issue from Azure portal and Application Insights API .
- Root Cause: The failure was due to change in query pattern introduced to provide additional metadata details which impacted performance our underlying platform component. We have reverted back the change for now and will be introducing new data indexing feature in few days to provide same functionality.
- Lessons Learned: We have worked with out partner teams to understand the desired query pattern and have taken appropriate steps at out side to honour the same. We will be also working on detailed postmortem of this issue to avoid similar disruptions in future.
- Incident Timeline: 25 minutes - 01/25, 12:30 UTC through 01/25, 12:55 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impacts related to Data Access Issues in last few days would have caused.
We are aware of issues within Application Insights and are actively investigating. Some customers may experience Data Access Issue in Azure Portal. The following data types are affected: Availability,Customer Event,Dependency,Exception,Metric,Page Load,Page View,Performance Counter,Request,Trace.
- Work Around: none
- Next Update: Before 01/25 15:30 UTC
We are working hard to resolve this issue and apologize for any inconvenience.