| Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Salary
$142,800–$274,800 / yr
Posted
59 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $180k
This role $209k
$101k most similar roles pay here $293k

This role pays more than 95% of similar roles. Most pay $152,150–$207,350 — the shaded band above. At the midpoint, this role pays about $209k versus about $180k for comparable roles.

Based on 239 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1580 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 1408 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · | Microsoft Careers

As a Principal Site Reliability Engineering Manager within Microsoft’s ES365 organization, you will lead a team of diverse SREs to enhance the reliability and operational proficiency of large-scale engineering systems used by teams building Office, Exchange, and Microsoft 365. Your day-to-day responsibilities include partnering with engineers and product managers to design and maintain reliable services, driving cross-organizational alignment through shared standards and tooling, and establishing service level objectives and indicators. You will also mentor team members on reliability engineering and incident response, automate processes to reduce operational overhead, and foster a culture of inclusivity and high performance. The role requires expertise in Azure cloud services, containerization, orchestration, and mature observability practices such as metrics, logs, and tracing, with a focus on improving engineers' productivity through scalable and reliable operations at enterprise scale.

What you'll do

  • Partner with teams to design and maintain reliable and resilient services.
  • Drive cross-organizational alignment through partnerships and co-development.
  • Build and retain a team of Site Reliability Engineers.
  • Define, implement, and operate SLOs/SLIs for critical engineering systems.
  • Lead incident management and conduct blameless post-incident reviews.
  • Drive automation to reduce operational toil and improve efficiency.
  • Establish observability practices to meet reliability and latency goals.

What we're looking for

  • 5+ years of experience leading large-scale initiatives involving multiple engineers.
  • Proven expertise in reliability engineering for developer-facing or platform services.
  • Strong background in incident response, automation, and observability practices.
  • Experience with enterprise-scale distributed cloud service architecture and deployment.
  • Deep knowledge of operating CI/CD, build, and release platforms reliably and safely.
  • Ability to work across disciplines and align teams on reliability priorities.
  • Skilled in architecting and managing containerized services and orchestration tools.

More like this

Similar roles

| Microsoft Careers

Microsoft

US 63 days ago $142,800$274,800
Python JavaScript C++ Java Kubernetes AWS Azure Docker CI/CD PostgreSQL MongoDB Redis Apache Spark TensorFlow PyTorch Prometheus Grafana Git Jenkins Responsible AI Scikit-learn
Hybrid

| Microsoft Careers

Microsoft

Redmond, WA 12 days ago $142,800$274,800
Python MATLAB RF measurement time-domain control AI ML automation tools topological qubits spin qubits superconducting qubits quantum characterization verification validation data acquisition statistical analysis cryogenic electrical measurements

| Microsoft Careers

Microsoft

Redmond, WA 51 days ago $142,800$274,800
Python TensorFlow PyTorch Kubernetes Docker CI/CD PostgreSQL MongoDB AWS Azure NLP Multimodal_Models Fine_Tuning Reinforcement_Learning A/B_Testing Predictive_Analytics Statistical_Methodologies ACL EMNLP SIGKDD AAAI WSDM COLING WWW ICASSP

| Microsoft Careers

Microsoft

Redmond, WA 53 days ago $127,600$229,200
UPS Generator AHU Servers SANs Networking Rack/Enclosures Structured Cabling CompTIA ITIL v3 Foundation MOF Certifications PMP CDCP CCNA Certifications ASICS/Inventory Control Leadership Development Certificates

| Microsoft Careers

Microsoft

Redmond, WA 55 days ago
CUDA GPU ROCm Triton PTX CUTLASS C++ Parallel Computing Algorithm Optimization Performance Profiling Memory Hierarchies Deep Learning Model Compression Accelerator Design Machine Learning Systems Research

| Microsoft Careers

Microsoft

Redmond, WA 54 days ago
inventory management systems configuration management databases asset management repositories RMA portals CI/CD Terraform AWS Kubernetes Docker Prometheus Grafana Python SQL PostgreSQL Git Jira Confluence