Principal Site Reliability Engineering Manager, Secret Cleared Environments

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Salary
$142,800–$274,800 / yr
Posted
87 days ago
Closes
Sep 28, 2026

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $189k
This role $209k
$127k most similar roles pay here $291k

This role pays more than 66% of similar roles. Most pay $156,681–$222,000 — the shaded band above. At the midpoint, this role pays about $209k versus about $189k for comparable roles.

Based on 239 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 694 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 636 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Principal Site Reliability Engineering Manager, Secret Cleared Environments

As a Principal Site Reliability Engineering Manager at Microsoft Substrate, you will lead a team responsible for building and maintaining highly reliable cloud services in regulated environments. Your day-to-day involves developing senior engineers, ensuring operational excellence through robust telemetry and disciplined engineering practices, and embedding reliability early in the design phase. You will manage incident response, drive continuous improvement using SLOs and SLIs, and collaborate with security and compliance teams to deliver durable outcomes. The role requires expertise in software engineering fundamentals, automation, and AI-assisted techniques for operational excellence at scale. Ideal candidates have a strong background in cloud or distributed systems and experience working in regulated environments, along with the ability to obtain necessary government clearances such as Tier 3 and CJIS eligibility.

What you'll do

  • Lead and develop a team of Site Reliability Engineers to ensure operational excellence and reliability.
  • Own the operational health and reliability of Substrate services in regulated environments, ensuring high availability and compliance.
  • Drive incident management and post-incident reviews, focusing on systemic fixes and long-term resilience.
  • Serve as an actively engaged on-call engineer, leading incident response and driving durable engineering improvements.
  • Embed reliability, security, and compliance considerations early in service design and deployment decisions.
  • Build strong cross-functional relationships to deliver compliant and auditable reliability outcomes.

What we're looking for

  • Doctorate or Master's degree in Computer Science/IT plus relevant technical experience (2+ years for PhD, 3+ years for MS).
  • Experience leading and developing a team of Site Reliability Engineers.
  • Strong background in software engineering fundamentals and operational excellence.
  • Ability to obtain and maintain Tier 3 background investigation for sensitive environments.
  • Proven track record operating services in regulated or compliance-sensitive environments.
  • Expertise in driving incident management, SLOs, SLIs, and operational metrics.

More like this

Similar roles

Principal Site Reliability Engineer

Microsoft

Redmond, WA 19 days ago $142,800$274,800
Kubernetes Terraform Python Go Docker CI/CD Prometheus Grafana PostgreSQL Azure AWS Git Linux DevOps SLO Security Compliance GCC_Moderate GCC_High DoD CJIS Tier_3_Background_Investigation

Principal Software Engineering Manager, Substrate

Microsoft

WA 95 days ago $142,800$274,800
Kubernetes Docker CI/CD Python Go PostgreSQL Azure AWS Terraform Git GitHub Jira Confluence Prometheus Grafana Security Compliance Incident Management DevOps MLOps

Principal Site Reliability Engineering Manager

Microsoft

71 days ago $142,800$274,800
Azure Kubernetes Docker CI/CD Prometheus Grafana Python Go PostgreSQL Terraform AWS GitOps SLOs SLIs Observability MetricstoLogsTracing BlamelessPostIncidentReviews SelfHealingSystems SafeRollouts AutomatedRemediation

Site Reliability Engineer II

Microsoft

Redmond, WA +1 25 days ago $102,100$202,200
Python Java Go C# CI/CD Terraform AWS Kubernetes Docker Prometheus Grafana PostgreSQL Linux Git Ansible Nginx SSL/TLS OAuth RESTful APIs JSON

Site Reliability Engineer

SpaceX

Hawthorne, CA 7 days ago $145,000$175,000
Kubernetes Linux Python DevOps Site Reliability Engineering Virtualization Hypervisor technologies Performance optimization techniques