HPC Operations Engineer
Nvidia
Quick summary
Market check
How this pay compares to similar roles
This role pays more than 77% of similar roles. Most pay $148,500–$227,487 — the shaded band above. At the midpoint, this role pays about $231k versus about $188k for comparable roles.
Based on 240 similar postings.
Employer
Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing
Microsoft currently has 728 open roles on FindRole.
Listed pay typically runs $119,800–$234,700 across 664 roles with salary data.
Most-posted roles
At a glance
As an experienced High Performance Computing Operations Engineering Manager on Microsoft AI’s SuperIntelligence Team, you will lead a team of Site Reliability Engineers to ensure the reliability and efficiency of large-scale distributed AI infrastructure. Your daily responsibilities include designing and maintaining monitoring systems for real-time visibility into model serving pipelines, building automation tools for hybrid cloud environments, managing incident response and conducting blameless postmortems, ensuring data privacy and compliance, and collaborating with ML engineers to improve developer experience. The role requires expertise in Kubernetes, Docker, Python, Go, Bash, and public cloud platforms like Azure, AWS, or GCP, along with a solid understanding of distributed systems, networking, and storage. This startup-like team focuses on advancing humanist superintelligence, aiming for breakthroughs that benefit society through ultra-capable AI systems anchored to human values.
Skills
What you'll do
What we're looking for
Related searches
More like this
Nvidia
Microsoft
Nvidia
Argonne National Laboratory
Argonne National Laboratory
Argonne National Laboratory