| Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Salary
$119,800–$234,700 / yr
Posted
14 days ago
Closes
Nov 28, 2026

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $199k
This role $177k
$106k most similar roles pay here $248k

This role pays more than 52% of similar roles. Most pay $176,937–$221,250 — the shaded band above. At the midpoint, this role pays about $177k versus about $199k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1580 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 1408 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · | Microsoft Careers

The Senior Software Engineer - AI Frameworks role at Microsoft’s AI Frameworks team involves architecting and implementing efficient tensor computation primitives for custom AI accelerators like MAIA. This position requires developing PyTorch features for model optimization on these accelerators and contributing to high-performance inference stacks such as vLLM and SGLang, focusing on scheduling and KV cache management. The ideal candidate will design and optimize kernels for NPUs and GPUs to enhance LLM performance, collaborating across disciplines to solve complex technical challenges. Proficiency in C++, Python, and GPU programming (CUDA) is essential, along with experience in PyTorch internals and large-scale model serving systems. This role offers a unique opportunity to shape Microsoft’s advanced AI infrastructure at the intersection of hardware and software innovation.

What you'll do

  • Architect and implement efficient tensor computation primitives for custom AI accelerators.
  • Develop PyTorch features to optimize model execution on custom AI hardware.
  • Improve scheduling and KV cache management in AI inference stacks like vLLM.
  • Design high-performance kernels for NPUs and GPUs to accelerate LLM workloads.
  • Profile and optimize software components for better performance on AI accelerators.

What we're looking for

  • 4+ years of technical engineering experience in C, C++, or Python.
  • Bachelor's Degree in Computer Science or related field.
  • Experience with PyTorch internals and custom operators.
  • Proficiency in developing AI inference stacks like vLLM or SGLang.
  • Expertise in NPU/GPU kernel development and optimization.
  • Knowledge of LLM concepts including attention mechanisms, KV caching, and quantization.

More like this

Similar roles

| Microsoft Careers

Microsoft

US 14 days ago $119,800$234,700
Python FastAPI Azure ADLS Gen2 Synapse Azure Data Explorer Airflow Terraform Bicep ARM CI/CD Prometheus Grafana Kubernetes PostgreSQL

Senior Software Engineer - AI Core Engineering

The Walt Disney Company

Remote (Usa - Ca - 1200 Grand Central Ave, US) 111 days ago $141,900$190,300
Python LLM APIs AWS Bedrock Azure AI Foundry LangChain LangGraph APIs SDKs OpenAI Anthropic Claude Observability Tracing Latency and cost dashboards Drift detection Multi-agent orchestration Synthetic data Enterprise governance Security Compliance Audit Policy enforcement
Remote

| Microsoft Careers

Microsoft

US 153 days ago $119,800$234,700
PyTorch ONNX vLLM SGLang NVLink PCIe TridentOmniscienTriton CUDA BF16 FP8 KV cache quantization Checkpointing Resharding TP PP Parallelism strategies Distributed training concepts Sharding Allreduce Performance profiling

Senior Software Engineer - AI Platform Team

Coinbase

Remote (Oakland, CA) 4 days ago
Python AWS Kubernetes Docker CI/CD PostgreSQL Prometheus Grafana MVP GraphQL FinOps LLM MicroVMs OAuth VectorDB Markdown Chatbots LowCode TraditionalML FineTuning PromptEngineering
Remote

Senior Software Engineer - AI

Oracle

Austin, TX 7 days ago $79,200$209,500
Java Python REST APIs SQL OCI AWS Azure Agile CI/CD Cloud Technologies AI-assisted software development tools GenAI prompt engineering EDI transactions Docker Kubernetes PostgreSQL Maven Git