Member of Technical Staff, High Performance Computing Engineer - MAI SuperIntelligence Team | Microsoft Careers

Microsoft

Hybrid

Quick summary

Work type
Hybrid
Location
Mountain View, CA
Salary
$139,900–$274,800 / yr
Posted
113 days ago
Closes
Aug 10, 2026

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $193k
This role $207k
$124k most similar roles pay here $291k

This role pays more than 68% of similar roles. Most pay $165,000–$221,725 — the shaded band above. At the midpoint, this role pays about $207k versus about $193k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 571 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 522 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Member of Technical Staff, High Performance Computing Engineer - MAI SuperIntelligence Team | Microsoft Careers

Microsoft AI seeks a Member of Technical Staff at the High Performance Computing Engineer level to join its cutting-edge team responsible for scaling infrastructure that trains advanced models like Copilot. This role involves designing, operating, and maintaining large-scale HPC environments, managing schedulers such as SLURM and Kubernetes, and ensuring efficient job scheduling on massive clusters. Engineers will develop automation tools using Bash or Python, collaborate with researchers to support complex workloads, and troubleshoot issues independently in a fast-paced environment. The ideal candidate has extensive experience deploying high-performance clusters on-premise or in the cloud, working with frameworks like SLURM and Kubernetes, and building scalable services on public clouds such as Azure, AWS, or GCP.

What you'll do

  • Design and maintain large-scale HPC environments for training advanced AI models.
  • Ensure reliable job scheduling by deploying and configuring HPC schedulers at scale.
  • Serve as a technical expert for core HPC domains like GPU compute or high-performance storage.
  • Develop automation tools using Bash and Python to enhance cluster reliability and efficiency.
  • Troubleshoot complex issues related to cluster usage and performance tuning of massive clusters.

What we're looking for

  • Bachelor’s degree in computer science or related field and 4+ years of experience in deploying or operating high-performance clusters.
  • Extensive hands-on experience working with high-scale training clusters using tools like SLURM, Kubernetes, Ray, and NVIDIA InfiniBand clusters.
  • Proven track record of building scalable services on public cloud infrastructure such as Azure, AWS, or GCP.
  • Experience designing, maintaining, and troubleshooting large-scale HPC environments in production settings.
  • Strong automation skills with Bash and/or Python for improving cluster reliability and operational efficiency.

More like this

Similar roles