| Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Salary
$142,800–$274,800 / yr
Posted
35 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $180k
This role $209k
$101k most similar roles pay here $293k

This role pays more than 96% of similar roles. Most pay $152,150–$207,350 — the shaded band above. At the midpoint, this role pays about $209k versus about $180k for comparable roles.

Based on 239 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1580 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 1408 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · | Microsoft Careers

As a Senior Site Reliability Engineer on the Incident Response SRE team at Microsoft, you will play a pivotal role in maintaining the resilience and continuous improvement of Substrate, which powers Microsoft 365. Your daily responsibilities include leading high-severity incident responses, debugging complex issues, enhancing observability through telemetry and alerting, defining service level indicators (SLIs) and objectives (SLOs), conducting live site health reviews, and translating learnings into proactive tests and product fixes. You will also design reliability drills to validate resilience strategies and draft process documentation for incident management. This role requires expertise in software engineering, systems administration, and network engineering, with a preference for experience in large-scale service operations and data-driven practices.

What you'll do

  • Lead high-severity incident response and debug complex issues to drive resolution.
  • Enhance telemetry, alerting, and dashboards to improve observability and reduce detection time.
  • Establish and track SLIs/SLOs for critical scenarios with engineering teams.
  • Translate business requirements into metrics and action during live site health reviews.
  • Design and execute reliability drills to validate resilience and recovery strategies.

What we're looking for

  • Extensive experience (8+ years) in software engineering, network engineering, or systems administration.
  • Proven ability to lead high-severity incident response and drive resolution with clear communication.
  • Expertise in enhancing observability through telemetry, alerting, and dashboards using One Microsoft tooling.
  • Strong skills in defining and measuring reliability metrics (SLIs/SLOs) for critical scenarios.
  • Experience translating business requirements into actionable metrics and driving live site health reviews.
  • Ability to design and execute resilience drills and translate learnings into proactive engineering solutions.

More like this

Similar roles

| Microsoft Careers

Microsoft

Redmond, WA 12 days ago $142,800$274,800
Python MATLAB RF measurement time-domain control AI ML automation tools topological qubits spin qubits superconducting qubits quantum characterization verification validation data acquisition statistical analysis cryogenic electrical measurements

| Microsoft Careers

Microsoft

WA +1 66 days ago $119,800$234,700
Microsoft Azure Kubernetes Terraform Python SQL PostgreSQL CI/CD Docker AWS Google Cloud Platform Project Management Scrum Agile DevOps Infrastructure as Code Quality Assurance Construction Management Vendor Management Contract Compliance Data Center Operations

| Microsoft Careers

Microsoft

Redmond, WA 59 days ago $139,900$274,800
Azure AWS GCP PowerShell AzureCLI CI/CD Python Kubernetes Terraform Docker PostgreSQL Snowflake Git Jira Confluence GitHub Slack Zoom GoogleMeet Miro Asana Trello

| Microsoft Careers

Microsoft

Redmond, WA 61 days ago $86,100$169,800
ATS SQL Python R PowerBI Google Analytics LinkedIn Slack Zoom Microsoft Office Service Level Agreements General Data Protection Regulation Office of Federal Compliance Programs

| Microsoft Careers

Microsoft

Redmond, WA 46 days ago $85,400$168,100
Python Docker Kubernetes CI/CD DevOps C# C++ Java JavaScript TypeScript Distributed Systems Cloud Infrastructure Model Serving Caching Batching Monitoring