Site Reliability Engineering- CTJ- Poly | Microsoft Careers

Microsoft

Actively hiring
US Posted 75 days ago $100,600$199,000 / year

At a glance

AI generated

TL;DR

Join Microsoft’s Windows Cloud Experiences Sovereign team as a Site Reliability Engineer (IC3) and contribute to delivering secure, high-quality remote work experiences through Windows 365/Azure Virtual Desktop technologies. You will collaborate with cross-discipline engineers to maintain clients and infrastructure while focusing on service reliability, enhancing monitoring, and building automation to reduce operational overhead. Your day-to-day involves troubleshooting issues during on-call rotations, deploying fixes, and participating in post-mortem analysis. Utilize tools like the safe deployment process (SDP) for managing production changes safely and engage with product engineering teams through code reviews and meetings to ensure continuous improvement. The role requires expertise in cloud technologies, automation scripting, and a deep understanding of Windows operating systems at scale.

Skills

Azure Kubernetes Docker CI/CD Python PostgreSQL Prometheus Grafana Terraform Ansible GitOps SRE Monitoring Alerting Incident Management

What you'll do

  • Responds to incidents during on-call rotations, mitigating impacts and deploying fixes.
  • Troubleshoots issues affecting availability, security, reliability, performance, and efficiency.
  • Manages production changes using existing tools and automation processes.
  • Contributes to product improvements by participating in code/design reviews and meetings.
  • Develops understanding of products at scale to enhance availability, security, quality, and observability.
  • Proposes potential improvements based on insights from telemetry data analysis.

What we're looking for

  • Minimum 5 years of experience in Site Reliability Engineering (SRE) or a related field.
  • Proven ability to troubleshoot and resolve complex technical issues in production environments.
  • Strong understanding of secure, reliable, and high-performance cloud infrastructure.
  • Experience with incident response, post-mortem analysis, and continuous improvement processes.
  • Proficiency in developing and maintaining automation scripts for operational tasks.
  • Knowledge of Windows 365 Cloud PC/Azure Virtual Desktop technologies and their management.
  • Ability to collaborate effectively across engineering teams and contribute to code/design reviews.

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $170k
This role $150k
$89k most similar roles pay here $212k

This role pays less than 65% of similar roles. Most pay $142,212–$197,989 — the shaded band above. At the midpoint, this role pays about $150k versus about $170k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 534 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 488 roles with salary data.

Most-posted roles

View all roles at Microsoft

More like this

Similar roles

| Microsoft Careers

Microsoft

US 70 days ago
Azure Kubernetes Docker CI/CD Python Go Terraform Prometheus Grafana AI ML Telemetry SDP PostgreSQL SQL Git Linux Windows Server DevOps SRE Cloud Security Capacity Planning
Hybrid

Site Reliability Engineer - CTJ - POLY | Microsoft Careers

Microsoft

US 101 days ago $119,800$234,700
Azure Kubernetes Ansible CI/CD GitHub Actions Linux Rocky 9 Redhat Mariner Python Go Terraform AWS Prometheus Grafana Docker SLIs/SLOs Chaos Engineering Infrastructure as Code Telemetry Observability Metrica Logs Traces Blameless Postmortems