| Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Salary
$119,800–$234,700 / yr
Posted
53 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $180k
This role $177k
$106k most similar roles pay here $248k

This role pays more than 64% of similar roles. Most pay $152,150–$207,350 — the shaded band above. At the midpoint, this role pays about $177k versus about $180k for comparable roles.

Based on 239 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1577 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 1405 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · | Microsoft Careers

As a Senior Site Reliability Engineer on the Incident Response SRE team at Microsoft, you will play a pivotal role in maintaining the resilience and reliability of Substrate and MSAI by preventing outages and swiftly resolving incidents. Your day-to-day responsibilities include leading high-severity incident responses, debugging complex issues, enhancing observability through telemetry and alerting tools, defining service level indicators (SLIs) and objectives (SLOs), and conducting live site health reviews to translate business requirements into actionable metrics. You will also draft policies for incident management and execute reliability drills to validate recovery strategies. This role requires expertise in software engineering, systems administration, and network engineering, with a preference for experience in cloud-scale services and familiarity with tools like One Microsoft.

What you'll do

  • Lead high-severity incident response and debug complex issues to drive rapid resolution.
  • Enhance telemetry, alerting, and dashboards using One Microsoft tooling for actionable insights.
  • Establish and track SLIs/SLOs with engineering teams for critical scenarios.
  • Translate business requirements into metrics during live site health review meetings.
  • Design and execute reliability drills to validate resilience and recovery strategies.
  • Draft process and policy documentation for incident preparation, response, and prevention.

What we're looking for

  • Extensive experience (8+ years) in software engineering, network engineering, or systems administration.
  • Proven ability to lead high-severity incident response and drive rapid resolution.
  • Strong skills in enhancing observability through telemetry, alerting, and dashboards.
  • Experience defining and measuring service reliability with SLIs/SLOs.
  • Capability to translate business requirements into metrics and actionable insights.
  • Expertise in designing and executing resilience drills and strategies.
  • Proficiency in drafting process and policy documentation for incident management.

More like this

Similar roles

| Microsoft Careers

Microsoft

Redmond, WA 13 days ago $142,800$274,800
Python MATLAB RF measurement time-domain control AI ML automation tools topological qubits spin qubits superconducting qubits quantum characterization verification validation data acquisition statistical analysis cryogenic electrical measurements

| Microsoft Careers

Microsoft

WA +1 67 days ago $119,800$234,700
Microsoft Azure Kubernetes Terraform Python SQL PostgreSQL CI/CD Docker AWS Google Cloud Platform Project Management Scrum Agile DevOps Infrastructure as Code Quality Assurance Construction Management Vendor Management Contract Compliance Data Center Operations

| Microsoft Careers

Microsoft

Redmond, WA 60 days ago $139,900$274,800
Azure AWS GCP PowerShell AzureCLI CI/CD Python Kubernetes Terraform Docker PostgreSQL Snowflake Git Jira Confluence GitHub Slack Zoom GoogleMeet Miro Asana Trello

| Microsoft Careers

Microsoft

Redmond, WA 62 days ago $86,100$169,800
ATS SQL Python R PowerBI Google Analytics LinkedIn Slack Zoom Microsoft Office Service Level Agreements General Data Protection Regulation Office of Federal Compliance Programs

| Microsoft Careers

Microsoft

Redmond, WA 47 days ago $85,400$168,100
Python Docker Kubernetes CI/CD DevOps C# C++ Java JavaScript TypeScript Distributed Systems Cloud Infrastructure Model Serving Caching Batching Monitoring