Principal Manager, Incident Management - AMER | Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Salary
$142,800–$274,800 / yr
Posted
2 days ago
Closes
Dec 15, 2026

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $192k
This role $209k
$127k most similar roles pay here $291k

This role pays more than 70% of similar roles. Most pay $170,000–$214,900 — the shaded band above. At the midpoint, this role pays about $209k versus about $192k for comparable roles.

Based on 239 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1633 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 1454 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Principal Manager, Incident Management - AMER | Microsoft Careers

The Principal Manager, Incident Management - AMER at Microsoft Cloud Infrastructure and Operations (CO+I) leads end-to-end incident management and crisis response for global data centers, orchestrating multi-team mitigation efforts to ensure rapid restoration and clear communication with stakeholders. This strategic role drives service reliability by improving Time to Detect (TTD) and Time to Mitigate (TTM), embedding best practices in incident management, and advancing telemetry and automation capabilities. The incumbent builds cross-organizational partnerships, develops high-performing teams, and fosters a culture of accountability and continuous improvement. Required qualifications include a Bachelor's degree and 6+ years of technical experience in critical environments or equivalent expertise, with preference for advanced degrees, extensive cloud system experience, and knowledge of BAS, BMS, and EPMS systems. This role supports Microsoft’s global datacenters and online businesses, ensuring reliability for over 1 billion customers worldwide.

What you'll do

  • Lead end-to-end incident management, orchestrating multi-team efforts to restore services quickly and communicate effectively with stakeholders.
  • Drive service reliability by improving Time to Detect (TTD) and Time to Mitigate (TTM), ensuring teams meet Service Level Objectives (SLOs).
  • Define and execute a reliability engineering strategy to enhance telemetry, alerting, automation, and predictive monitoring for proactive issue identification.
  • Build cross-organizational partnerships to standardize incident response processes and enable consistent high-quality service across regions.
  • Develop high-performing teams by fostering accountability, continuous improvement, and inclusion to deliver measurable customer impact.

What we're looking for

  • Bachelor's degree in relevant engineering or IT field and 6+ years of technical experience.
  • Master's degree in a related field with 8+ years of experience, or 12+ years of equivalent experience.
  • Experience in critical environment, network, service, systems engineering, or industrial controls.
  • Expertise in BAS, BMS, and EPMS systems.
  • Technical knowledge working with large-scale cloud or distributed systems.
  • Ability to read single line diagrams and diagnose faults from drawings.

More like this

Similar roles

| Microsoft Careers

Microsoft

US 64 days ago $142,800$274,800
CI/CD GitHub Azure DevOps Kubernetes Docker Python JavaScript PostgreSQL MongoDB AWS Azure Terraform Git Jenkins Prometheus Grafana DevOps AI Machine Learning Security Compliance Privacy Reliability

| Microsoft Careers

Microsoft

Redmond, WA 75 days ago $119,800$234,700
Azure Python SQL PySpark Azure Data Lake Apache Spark Azure Synapse Azure Data Explorer CI/CD Kubernetes Terraform Docker Prometheus Grafana PostgreSQL Data Governance Data Lineage Data Security AI Machine Learning

| Microsoft Careers

Microsoft

Redmond, WA 71 days ago
Microsoft Azure Kubernetes Terraform Python SQL PostgreSQL Docker CI/CD Prometheus Grafana Ansible Git JSON XML Task Hazard Analysis (THA) Method Statement of Work (MSOW) Root Cause Analysis (RCA)

| Microsoft Careers

Microsoft

Redmond, WA 60 days ago
SQL Azure SQL Database Power BI Excel Dynamics 365 ACM SIGMOD VLDB IEEE ICDE Relational database systems Distributed cache Key-value stores Vector databases Storage engines Model training and inference Big data analytics systems Query processing Query optimization Concurrency Transactions Resource management Scheduling Data cleaning Data transformation AI ML

| Microsoft Careers

Microsoft

Redmond, WA +2 65 days ago $119,800$234,700
Python C++ CUDA Docker Kubernetes Terraform CI/CD Prometheus Grafana PostgreSQL Redis Git Jenkins AWS Azure Google Cloud Platform Scikit-learn TensorFlow PyTorch

| Microsoft Careers

Microsoft

Redmond, WA 64 days ago $100,600$199,000
Python SQL MachineLearning StatisticalModeling A/BTesting Dashboards Metrics CloudServices CI/CD Docker Kubernetes Terraform Azure AWS PostgreSQL MSSQL