Senior Software Engineer - AI Frameworks | Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Salary
$119,800–$234,700 / yr
Posted
4 days ago
Closes
Nov 28, 2026

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $193k
This role $177k
$106k most similar roles pay here $248k

This role pays more than 52% of similar roles. Most pay $166,100–$220,250 — the shaded band above. At the midpoint, this role pays about $177k versus about $193k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 728 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 664 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Senior Software Engineer - AI Frameworks | Microsoft Careers

The Senior Software Engineer - AI Frameworks role at Microsoft’s AI Frameworks team involves architecting and implementing efficient tensor computation primitives for custom AI accelerators like MAIA. This position requires developing PyTorch features for model optimization on these accelerators and contributing to high-performance inference stacks such as vLLM and SGLang, focusing on scheduling and KV cache management. The ideal candidate will design and optimize kernels for NPUs and GPUs to enhance LLM performance, collaborating across disciplines to solve complex technical challenges. Proficiency in C++, Python, and GPU programming (CUDA) is essential, along with experience in PyTorch internals and large-scale model serving systems. This role offers a unique opportunity to shape Microsoft’s advanced AI infrastructure at the intersection of hardware and software innovation.

What you'll do

  • Architect and implement efficient tensor computation primitives for custom AI accelerators.
  • Develop PyTorch features to optimize model execution on custom AI hardware.
  • Improve scheduling and KV cache management in AI inference stacks like vLLM.
  • Design high-performance kernels for NPUs and GPUs to accelerate LLM workloads.
  • Profile and optimize software components for better performance on AI accelerators.

What we're looking for

  • 4+ years of technical engineering experience in C, C++, or Python.
  • Bachelor's Degree in Computer Science or related field.
  • Experience with PyTorch internals and custom operators.
  • Proficiency in developing AI inference stacks like vLLM or SGLang.
  • Expertise in NPU/GPU kernel development and optimization.
  • Knowledge of LLM concepts including attention mechanisms, KV caching, and quantization.

More like this

Similar roles

Senior Software Engineer, AI Frameworks

Nvidia

Remote (Santa Clara, CA) 68 days ago $152,000$241,500
Kubernetes Docker CI/CD Python Go C++ PyTorch Ray llm-d Dynamo Helm Kustomize Terraform AWS Grafana Prometheus GitOps OpenAPI Swagger
Remote

Senior Software Engineer - AI Core Engineering

The Walt Disney Company

Remote (Usa - Ca - 1200 Grand Central Ave, US) 101 days ago $141,900$190,300
Python LLM APIs AWS Bedrock Azure AI Foundry LangChain LangGraph APIs SDKs OpenAI Anthropic Claude Observability Tracing Latency and cost dashboards Drift detection Multi-agent orchestration Synthetic data Enterprise governance Security Compliance Audit Policy enforcement
Remote

| Microsoft Careers

Microsoft

US 143 days ago $119,800$234,700
PyTorch ONNX vLLM SGLang NVLink PCIe TridentOmniscienTriton CUDA BF16 FP8 KV cache quantization Checkpointing Resharding TP PP Parallelism strategies Distributed training concepts Sharding Allreduce Performance profiling