| Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Salary
$142,800–$274,800 / yr
Posted
8 days ago
Closes
Nov 24, 2026

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $180k
This role $209k
$106k most similar roles pay here $293k

This role pays more than 76% of similar roles. Most pay $152,150–$208,800 — the shaded band above. At the midpoint, this role pays about $209k versus about $180k for comparable roles.

Based on 239 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 728 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 664 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · | Microsoft Careers

As a Principal Site Reliability Engineer in Microsoft’s Incident Response SRE team, you will play a pivotal role in maintaining the resilience and reliability of Substrate and MSAI services. Your responsibilities include leading high-severity incident responses, enhancing observability through telemetry and alerting systems, defining service level indicators and objectives, conducting live site health reviews, and translating learnings into proactive engineering practices to prevent future incidents. You will also design and execute reliability drills to validate resilience strategies and develop process documentation for incident management. This role requires expertise in software engineering, network engineering, or systems administration, with a preference for experience in large-scale cloud or distributed systems. The team operates at the cutting edge of global service health management, ensuring that Microsoft 365 remains resilient and continuously improving through data-driven practices and automation.

What you'll do

  • Lead high-severity incident response and drive incidents to resolution with clear communication.
  • Enhance telemetry, alerting, and dashboards using One Microsoft tooling to improve observability.
  • Establish and track SLIs/SLOs for critical scenarios in partnership with engineering teams.
  • Translate business requirements into metrics and action during live site health reviews.
  • Design and execute drills simulating product failures to validate resilience and recovery strategies.

What we're looking for

  • Doctorate, Master's, or Bachelor's degree in Computer Science, Information Technology, or related field.
  • 3+ years of technical experience in software engineering, network engineering, or systems administration.
  • Experience leading high-severity incident response and driving systemic improvements.
  • Strong skills in enhancing observability through telemetry, alerting, and dashboards.
  • Ability to define and measure reliability using SLIs/SLOs for critical scenarios.
  • 7+ years of experience working with large-scale cloud or distributed systems preferred.

More like this

Similar roles

| Microsoft Careers

Microsoft

Redmond, WA 57 days ago $119,800$234,700
Azure Python Java Scala Spark Hadoop HDFS Kafka Flink Docker Kubernetes CI/CD PostgreSQL Redis Elasticsearch Prometheus Grafana Git Jenkins
Hybrid

| Microsoft Careers

Microsoft

Redmond, WA 52 days ago $142,800$274,800
Azure Kubernetes Docker CI/CD Python PostgreSQL Terraform Prometheus Grafana Git Jira Swagger RESTful APIs JSON YAML DevOps Scrum Agile
Hybrid

| Microsoft Careers

Microsoft

Mountain View, CA 53 days ago $142,800$274,800
Python Java JavaScript C# Azure AWS GCP Docker Kubernetes CI/CD PostgreSQL MSSQL
Hybrid

| Microsoft Careers

Microsoft

US 53 days ago $142,800$274,800
Python JavaScript C++ Java Kubernetes AWS Azure Docker CI/CD PostgreSQL MongoDB Redis Apache Spark TensorFlow PyTorch Prometheus Grafana Git Jenkins Responsible AI Scikit-learn
Hybrid

| Microsoft Careers

Microsoft

US 178 days ago $119,800$234,700
Python Pandas NumPy Spark Ray Apache_Beam Azure PostgreSQL Kubernetes Docker CI/CD Git Jupyter_Notebook TensorFlow PyTorch Hugging_Face GitHub Visual_Studio_Code Prometheus Grafana

| Microsoft Careers

Microsoft

US 46 days ago
Azure Python C# JavaScript R Terraform Bicep Azure Functions Docker API Management Azure Cognitive Services Azure OpenAI Azure AI Search Vector Indexes Azure Document Processing Infrastructure as Code CI/CD