Final Update: Monday, 6/22/2015 14:23 UTC
We’ve confirmed that all systems are back to normal with no customer impact as of 06/22, 13:56 UTC. Our logs show the incident started on 06/22, 9:49 UTC and that during the 4 hours that it took to resolve the issue ~10% of customers experienced false failures in availability data or data gaps in availability data, if customers have their availability tests configured to run from Amsterdam location.
• Root Cause: The failure was due to failure in network device in Amsterdam datacenter.
• Lessons Learned: Network team confirmed that the bug was introduces during latest software upgrade for network devices, which is causing this issue. Network team will roll back the upgrade and will change escalation procedure to get engaged faster if such issue occurs again.
• Incident Timeline: 4 Hours & 7 minutes - 06/22, 9:49 UTC through 06/22, 9:49 UTC.
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Application Insights Service Delivery Team