Principal Software Engineer - Performance Tooling | Microsoft Careers

Microsoft

Actively hiring Posted this week Verified listing
US Posted 5 days ago $142,800$274,800 / year

At a glance

AI generated

TL;DR

As a Principal Software Engineer on Microsoft’s AI Frameworks team, you will work across multiple layers of the AI software stack, including abstractions, programming models, compilers, runtimes, libraries, and APIs, to enable large-scale training and inference of AI models. Your daily tasks will involve benchmarking OpenAI and other LLMs for performance on GPUs and Microsoft hardware, debugging and optimizing their performance, and monitoring regressions to ensure continuous improvements in deployment speed and efficiency. You will collaborate closely with researchers and engineers to deliver scalable, production-ready solutions that support major Microsoft products like Office, Windows, Bing, SQL Server, and Dynamics. The role requires expertise in C++ or Python, experience with high-performance applications on CPUs/GPUs, familiarity with DL frameworks such as PyTorch, TensorFlow, or ONNX Runtime, and proficiency in GPU profiling tools.

Skills

Python C++ PyTorch TensorFlow ONNX_Runtime CUDA ROCm Triton GPU_Profiling_Tools Docker CI/CD Linux Windows

What you'll do

  • Benchmark OpenAI and other LLM models for performance on GPUs and Microsoft hardware.
  • Debug and optimize training/inference workloads on CPUs/GPUs for better performance.
  • Monitor performance regressions and drive continuous improvements in deployment efficiency.
  • Enable large-scale AI model deployments with minimal hardware requirements.
  • Collaborate with researchers to deliver scalable, production-ready AI performance enhancements.

What we're looking for

  • Master's Degree in Computer Science or related field with 12+ years experience, or Bachelor's with 15+ years experience in technical engineering.
  • Proficient in coding languages such as C++ or Python for high performance applications and debugging on CPUs/GPUs.
  • Experience working with DNN/LLM inference and familiarity with DL frameworks like PyTorch, TensorFlow, ONNX Runtime, CUDA, ROCm, Triton.
  • Strong background in software engineering principles, computer architecture, GPU architecture, and hardware neural net acceleration.
  • Expertise in end-to-end performance analysis and optimization of state-of-the-art LLMs and HPC applications using GPU profiling tools.
  • Ability to independently lead projects and collaborate effectively with cross-functional teams.

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $196k
This role $209k
$127k most similar roles pay here $291k

This role pays more than 69% of similar roles. Most pay $177,250–$214,500 — the shaded band above. At the midpoint, this role pays about $209k versus about $196k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 534 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 488 roles with salary data.

Most-posted roles

View all roles at Microsoft

More like this

Similar roles

Principal Software Engineer | Microsoft Careers

Microsoft

Redmond, WA 109 days ago $139,900$274,800
Python Java JavaScript C# AI CI/CD Kubernetes Docker AWS Azure PostgreSQL MongoDB Git Jenkins GitHub Swagger RESTful APIs Microservices Cloud Native DevOps SRE Observability Security
Hybrid

Principal Software Engineer | Microsoft Careers

Microsoft

US 132 days ago $139,900$274,800
Python Java Kubernetes AWS Azure CI/CD MLOps Apache Spark Flink Docker Prometheus Grafana PostgreSQL Redis Scalability High-Availability Multi-Agent Systems Reinforcement Learning
Hybrid

Principal Software Engineer | Microsoft Careers

Microsoft

US 88 days ago $139,900$274,800
C C++ Rust Python JavaScript Java .NET Performance Engineering Large-Scale Software Design Architectural Modernization Legacy Codebase Optimization Performance Tooling Automation AI-Assisted Diagnostics Cross-Team Collaboration Code Reviews
Hybrid

Principal Software Engineer | Microsoft Careers

Microsoft

US 10 days ago $165,600$296,400
Azure Kubernetes Docker CI/CD Apache Spark Kafka PostgreSQL Redis GraphQL Python JavaScript TypeScript React Node.js ML/AI Data pipelines Microservices APIs Schema evolution Telemetry Operational excellence
Hybrid

Principal Software Engineer | Microsoft Careers

Microsoft

US 48 days ago $163,000$296,400
CUDA TensorRT Triton PyTorch Nsight Azure H100 A100 LLM SLM MoE Model Compression Quantization Distillation SVD Low-Rank Methods Continuous Batching KV-Cache Optimizations Routing DLIS Talon
Hybrid