Showing results for Postmortem - Azure DevOps Service

Oct 16, 2018
0

Postmortem: Azure DevOps Service Outages in October 2018

Azure DevOps SRE
Azure DevOps SRE

Earlier this month, Azure DevOps experienced several significant service outages, for which we are deeply sorry. As with every significant live site incident, we have completed a detailed root cause analysis for these. Due to the proximity of these incidents and common underlying causes, we wanted to share the details with you to ensure that you kn...

Sep 24, 2018
0

Postmortem – VS Marketplace outage – 4 September 2018

Azure DevOps SRE
Azure DevOps SRE

On Tuesday, 4 September 2018, Visual Studio Marketplace suffered an extended outage affecting most of its customers. Marketplace hosts and serves extensions for the Visual Studio IDE, Visual Studio Code, and Azure DevOps. This was the first instance of the Marketplace service going down completely, and we sincerely apologize for the outage. What ha...

Sep 10, 2018
0

Postmortem: VSTS 4 September 2018

Azure DevOps SRE
Azure DevOps SRE

Postmortem – VSTS Outage – 4 September 2018 On Tuesday, 4 September 2018, VSTS (now called Azure DevOps) suffered an extended outage affecting customers with organizations hosted in the South Central US region (one of the 10 regions globally hosting VSTS customers). The outage also impacted customers globally due to cross-service dependencies. It r...

May 25, 2018
0

Postmortem: Global VSTS availability issues – 22 May 2018

Azure DevOps SRE
Azure DevOps SRE

Customer Impact: On 22 May 2018, Visual Studio Team Services (VSTS) experienced a major incident across multiple regions between 15:00 and 16:55 UTC.  An event in a Western European scale unit of the Team Foundation Service (TFS), caused a chain reaction that sporadically took other TFS scale units offline in other regions.  Based on our telemetry...

Feb 14, 2018
0

Preliminary Postmortem: Performance Issues and failures in VSTS West Europe – 7 February 2018

Azure DevOps SRE
Azure DevOps SRE

Edit February 26, 2018: We have just posted an updated and complete postmortem here: https://devblogs.microsoft.com/devopsservice/?p=16295 Customer Impact On 7 February 2018 we had an incident which impacted users in our Western European scale unit. During this time, users experienced slow performance and 503 errors (service unavailable) when i...

Dec 27, 2017
0

Postmortem – Intermittent Failures for Visual Studio Team Services on 14 Dec 2017

Azure DevOps SRE
Azure DevOps SRE

On 14 December 2017 we began to have a series of incidents with Visual Studio Team Services (VSTS) for several days that had a serious impact on the availability of our service for many customers (incident blogs #1 #2 #3). We apologize for the disruption these incidents had on you and your team. Below we describe the cause and the actions we are ta...

Dec 19, 2017
0

Postmortem – Availability issues with Visual Studio Team Services on 6 Dec 2017

Azure DevOps SRE
Azure DevOps SRE

On 6 December 2017 we had a global incident with Visual Studio Team Services (VSTS) that had a serious impact on the availability of our service (incident blog here). We apologize for the disruption. Below we describe the cause and the actions we are taking to address the issues. Customer Impact This was a global incident that caused performance...

Dec 18, 2017
0

Postmortem – Availability issues with Visual Studio Team Services on 21 November 2017

Azure DevOps SRE
Azure DevOps SRE

On 21 November 2017 we had an incident with Visual Studio Team Services (VSTS) that had a serious impact on the availability of our service for many customers (incident blog here). We apologize for the disruption. Below we describe the cause and the actions we are taking to address the issues. Customer Impact This incident caused performance issu...

Dec 13, 2017
0

Postmortem – Availability issues with Visual Studio Team Services on 16 November 2017

Azure DevOps SRE
Azure DevOps SRE

On 16 November 2017 we had a global incident with Visual Studio Team Services (VSTS) that had a serious impact on the availability of our service (https://blogs.msdn.microsoft.com/vsoservice/?p=15526). We apologize for the disruption. Below we describe the cause and the actions we are taking to address the issues. Customer Impact This was a globa...