Principal Software Engineer - Performance Tooling | Microsoft Careers
Microsoft
At a glance
AI generatedAs a Principal Software Engineer on the Artificial Intelligence Cloud Inference team at Microsoft, you will work closely with OpenAI to optimize the performance of large language models (LLMs) across various hardware platforms, including GPUs and custom silicon. Your daily tasks will involve benchmarking LLMs for performance, debugging and optimizing code at multiple levels, from kernels to systems, and developing software tools to enhance insights into performance bottlenecks. You will also collaborate with internal and external partners to ensure fast deployment of models on new hardware and contribute to achieving Azure's capital expenditure goals by reducing the computing fleet footprint. This role requires expertise in high-performance applications, GPU architecture, and deep learning frameworks like PyTorch or TensorFlow, as well as proficiency in CUDA and ROCm for efficient model inference and optimization.
Skills
What you'll do
What we're looking for
Market check
This $139,900–$274,800 range sits above 62% of similar postings on FindRole.
Peer median band
$140,600–$261,850
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$177,250–$214,625
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing
Microsoft currently has 451 open roles on FindRole.
Listed pay typically runs $119,800–$234,700 across 417 roles with salary data.
Most-posted roles
More like this
Microsoft
Microsoft
Microsoft
Microsoft
Microsoft
Microsoft