Principal Site Reliability Engineer

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Salary
$142,800–$274,800 / yr
Posted
19 days ago
Closes
Dec 5, 2026

Market check

Salary context

How this pay compares to similar roles

Similar $184k
$134k most similar roles pay here $229k

This listing doesn't post a salary. Most similar roles pay $152,150–$215,500.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 622 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 571 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Principal Site Reliability Engineer

As a Principal Site Reliability Engineer at Microsoft Substrate, you will play a pivotal role in setting technical and operational direction for reliability across critical cloud services like Exchange Online and M365 Copilot. Your responsibilities include defining reliability strategies, leading complex incident responses, and driving architectural decisions that ensure high availability and security in regulated environments such as GCC Moderate, GCC High, and DoD. You will architect large-scale automation and observability solutions, mentor senior engineers, and represent SRE perspectives to leadership. The ideal candidate has extensive experience with cloud or distributed systems, particularly in compliance-sensitive settings, and possesses strong skills in software engineering, network engineering, and system administration. This role demands a Tier 3 background investigation for GCCH and DoD environments and CJIS eligibility for GCC Moderate access, ensuring you can navigate the complexities of government cloud requirements while driving technical excellence across Microsoft’s foundational services.

What you'll do

  • Define and drive reliability strategy and SLO frameworks for Substrate workloads in regulated environments.
  • Lead incident response and provide technical direction during complex high-impact incidents.
  • Architect large-scale automation, observability, and self-healing solutions for Substrate services.
  • Influence architectural decisions to ensure intrinsic reliability, security, and compliance.
  • Mentor senior engineers and shape the long-term technical direction of SRE discipline.
  • Drive post-incident reviews resulting in systemic engineering improvements across teams.
  • Represent Substrate SRE perspectives with senior leadership and cross-functional partners.

What we're looking for

  • Must obtain and maintain appropriate background investigations and customer screenings for Microsoft Government cloud environments.
  • Requires experience working with large-scale cloud or distributed systems in regulated environments.
  • Expected to define reliability strategy, SLO frameworks, and operational best practices for critical services.
  • Serve as an actively engaged senior on-call engineer, leading incident response during complex incidents.
  • Mentor senior engineers and shape the long-term technical direction of Site Reliability Engineering.

More like this

Similar roles

Principal Site Reliability Engineering Manager

Microsoft

71 days ago $142,800$274,800
Azure Kubernetes Docker CI/CD Prometheus Grafana Python Go PostgreSQL Terraform AWS GitOps SLOs SLIs Observability MetricstoLogsTracing BlamelessPostIncidentReviews SelfHealingSystems SafeRollouts AutomatedRemediation

Principal Software Engineering Manager, Substrate

Microsoft

WA 95 days ago $142,800$274,800
Kubernetes Docker CI/CD Python Go PostgreSQL Azure AWS Terraform Git GitHub Jira Confluence Prometheus Grafana Security Compliance Incident Management DevOps MLOps

Site Reliability Engineer II

Microsoft

Redmond, WA +1 25 days ago $102,100$202,200
Python Java Go C# CI/CD Terraform AWS Kubernetes Docker Prometheus Grafana PostgreSQL Linux Git Ansible Nginx SSL/TLS OAuth RESTful APIs JSON

Site Reliability Engineer

Microsoft

Redmond, WA +1 3 days ago $119,800$234,700
Azure Terraform Kubernetes Docker PowerShell Python Bash ARM templates Azure Bicep Spark Hadoop CI/CD PostgreSQL Git Azure Container Apps AKS ACI Event Hubs Synapse

Site Reliability Engineer

Microsoft

US 31 days ago $102,100$202,200
Python JavaScript Docker Kubernetes Terraform Azure CI/CD PostgreSQL SQL Prometheus Grafana Git RESTful APIs OAuth SAML Zero-Touch Deployment M365 Services Exchange Online Protection Microsoft Defender for Office