| Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Salary
$139,900–$274,800 / yr
Posted
85 days ago
Closes
Sep 19, 2026

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $193k
This role $207k
$124k most similar roles pay here $291k

This role pays more than 56% of similar roles. Most pay $177,250–$208,800 — the shaded band above. At the midpoint, this role pays about $207k versus about $193k for comparable roles.

Based on 239 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1577 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 1405 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · | Microsoft Careers

Join Microsoft’s AI Core team as a senior systems engineer focusing on high-performance runtime systems for large-scale LLM inferencing, with deep C++ expertise. You will design and implement microservices and runtime components to optimize AI inferencing systems for latency, throughput, cost, and reliability at scale. Responsibilities include debugging complex production issues, integrating model inference pipelines into scalable infrastructure, and driving innovations in real-time and batch inferencing efficiency. The role requires 6+ years of experience in systems programming with C++, proven track record in building and operating scalable cloud services, strong debugging skills, and hands-on experience with distributed systems, Kubernetes, and CUDA for large-scale LLM infrastructures. Preferred candidates have additional experience optimizing AI model inference stacks and working on Azure OpenAI or similar platforms.

What you'll do

  • Design and implement high performance microservices and runtime components in C++.
  • Optimize AI inferencing systems for latency, throughput, cost, and reliability at scale.
  • Debug and resolve complex production issues related to performance, scaling, and service reliability.
  • Contribute to state-of-the-art multimodal inferencing systems supporting text, speech, and vision workloads.
  • Drive systems level innovations for real-time and batch inferencing efficiency.

What we're looking for

  • 6+ years of systems programming experience with strong C++ expertise.
  • Proven track record in building, deploying, and operating scalable cloud services.
  • Expertise in debugging complex issues using performance profiling tools.
  • Hands-on experience with distributed systems, Kubernetes, and containerized workloads.
  • Experience optimizing large-scale LLM inferencing infrastructure, including CUDA.

More like this

Similar roles

Principal Software Engineer, CoreAI | Microsoft Careers

Microsoft

US 95 days ago $142,800$274,800
Python C++ Java JavaScript Azure CI/CD Kubernetes Docker Terraform Prometheus Grafana LLMs SLMs Multimodal_Models Code_Specific_Models Scalability Reliability Security Privacy Cloud_Infrastructure DevOps

| Microsoft Careers

Microsoft

US 91 days ago $142,800$274,800
Kubernetes Python C C++ Java JavaScript Terraform AWS Azure PostgreSQL CI/CD Prometheus Grafana Docker RDMA InfiniBand NCCL CUDA AKS Dynamic Resource Allocation(DRA)

Principal Software Engineer - CoreAI | Microsoft Careers

Microsoft

Redmond, WA 14 days ago $142,800$274,800
Azure Python SQL Kubernetes Docker CI/CD Terraform PostgreSQL Snowflake Apache Spark Data Governance Machine Learning AI Tools Self-service Analytics Cross-functional Collaboration Cloud Services Data Lineage Security Best Practices