Senior Researcher - Efficient AI | Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Salary
$119,800–$234,700 / yr
Posted
3 days ago
Closes
Dec 8, 2026

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $209k
This role $177k
$104k most similar roles pay here $265k

This role pays less than 64% of similar roles. Most pay $175,300–$242,325 — the shaded band above. At the midpoint, this role pays about $177k versus about $209k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1568 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 1397 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Senior Researcher - Efficient AI | Microsoft Careers

As a Senior Applied Research Engineer on Microsoft’s generative AI team, you will work across the full stack to advance efficiency in AI systems, focusing on algorithmic, system-level, and hardware/software co-design techniques. Your day-to-day responsibilities include formulating new approaches for end-to-end AI serving, evaluating endpoint configurations, and optimizing GPU architectures through close collaboration with model, kernel, compiler, and hardware teams. You will build experimental prototypes to validate research ideas, publish findings in academic conferences, and contribute to open-source projects. The role requires a deep understanding of machine learning frameworks like PyTorch and TensorFlow, expertise in GPU programming using CUDA or ROCm, and proficiency in C++ and Python for high-performance systems. This position emphasizes real-world impact through rigorous prototyping, validation, and deployment to enhance productivity across Microsoft 365’s massive user base.

What you'll do

  • Develop and evaluate new algorithmic approaches for AI serving to improve efficiency.
  • Design and experimentally validate endpoint configuration strategies for optimal performance.
  • Collaborate with hardware teams to optimize serving algorithms for accelerator capabilities.
  • Build experimental prototypes and conduct large-scale measurements to drive research ideas.
  • Publish research findings, file patents, and contribute to open-source systems frameworks.
  • Take responsibility for driving research ideas through prototyping, validation, and deployment.

What we're looking for

  • Doctorate in a relevant field or equivalent experience.
  • 3+ years of experience designing and optimizing efficient inference systems.
  • Expertise in machine learning frameworks like PyTorch, TensorFlow, and serving frameworks such as vLLM, Triton Inference Server.
  • Proficiency in GPU programming with CUDA, ROCm, and other GPU optimization tools.
  • Strong knowledge of algorithmic optimization, parallel computing, and request orchestration under strict SLO constraints.
  • Research impact through publications and patents, along with hands-on experience delivering research ideas to production.

More like this

Similar roles

| Microsoft Careers

Microsoft

WA 158 days ago $119,800$234,700
Python C++ CUDA TensorFlow PyTorch Kubernetes Docker AWS Azure CI/CD Git GitHub Jupyter PostgreSQL MESOS ZooKeeper