Senior Software Engineer - AI Frameworks | Microsoft Careers

Microsoft

Quick summary

Work type: On-site
Location: Redmond, WA
Salary: $119,800–$234,700 / yr
Posted: 4 days ago
Closes: Nov 28, 2026

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $193k

This role $177k

$106k most similar roles pay here $248k

This role pays more than 52% of similar roles. Most pay $166,100–$220,250 — the shaded band above. At the midpoint, this role pays about $177k versus about $193k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 728 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 664 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Senior Software Engineer - AI Frameworks | Microsoft Careers

Apply Now Log in to save

The Senior Software Engineer - AI Frameworks role at Microsoft’s AI Frameworks team involves architecting and implementing efficient tensor computation primitives for custom AI accelerators like MAIA. This position requires developing PyTorch features for model optimization on these accelerators and contributing to high-performance inference stacks such as vLLM and SGLang, focusing on scheduling and KV cache management. The ideal candidate will design and optimize kernels for NPUs and GPUs to enhance LLM performance, collaborating across disciplines to solve complex technical challenges. Proficiency in C++, Python, and GPU programming (CUDA) is essential, along with experience in PyTorch internals and large-scale model serving systems. This role offers a unique opportunity to shape Microsoft’s advanced AI infrastructure at the intersection of hardware and software innovation.

Skills

PyTorch vLLM SGLang CUDA Python C++ GPU NPU TensorFlow Kubernetes Docker CI/CD Prometheus Grafana

What you'll do

Architect and implement efficient tensor computation primitives for custom AI accelerators.
Develop PyTorch features to optimize model execution on custom AI hardware.
Improve scheduling and KV cache management in AI inference stacks like vLLM.
Design high-performance kernels for NPUs and GPUs to accelerate LLM workloads.
Profile and optimize software components for better performance on AI accelerators.

What we're looking for

4+ years of technical engineering experience in C, C++, or Python.
Bachelor's Degree in Computer Science or related field.
Experience with PyTorch internals and custom operators.
Proficiency in developing AI inference stacks like vLLM or SGLang.
Expertise in NPU/GPU kernel development and optimization.
Knowledge of LLM concepts including attention mechanisms, KV caching, and quantization.

Similar roles

Senior Software Engineer - Data Platform, AI Infrastructure | Microsoft Careers

Microsoft

US 4 days ago $119,800–$234,700

Python FastAPI Azure ADLS Gen2 Synapse Azure Data Explorer Airflow Terraform Bicep ARM CI/CD Prometheus Grafana Kubernetes PostgreSQL

Save

Senior Software Engineer, AI Frameworks

Nvidia

Remote (Santa Clara, CA) 68 days ago $152,000–$241,500

Kubernetes Docker CI/CD Python Go C++ PyTorch Ray llm-d Dynamo Helm Kustomize Terraform AWS Grafana Prometheus GitOps OpenAPI Swagger

Remote

Save

Senior Software Engineer - AI Core Engineering

The Walt Disney Company

Remote (Usa - Ca - 1200 Grand Central Ave, US) 101 days ago $141,900–$190,300

Python LLM APIs AWS Bedrock Azure AI Foundry LangChain LangGraph APIs SDKs OpenAI Anthropic Claude Observability Tracing Latency and cost dashboards Drift detection Multi-agent orchestration Synthetic data Enterprise governance Security Compliance Audit Policy enforcement

Remote

Save

| Microsoft Careers

Microsoft

US 143 days ago $119,800–$234,700

PyTorch ONNX vLLM SGLang NVLink PCIe TridentOmniscienTriton CUDA BF16 FP8 KV cache quantization Checkpointing Resharding TP PP Parallelism strategies Distributed training concepts Sharding Allreduce Performance profiling

Save

Senior Software Engineer – Edge AI/GenAI & Multimedia

Qualcomm

San Diego, CA 30 days ago $111,300–$166,900

C C++ PyTorch TensorFlow ONNX Runtime LiteRT LangChain LlamaIndex Android Tizen Linux OpenGL ES GLSL Docker OCI Ubuntu CI/CD Multimedia AI GenAI Java JNI AIDL HIDL Binder

Save

Senior Software Engineer (AI Platform)

Smartly

Helsinki, Finland 50 days ago

Python TypeScript PostgreSQL Node.js Docker Kubernetes React AWS GCP CI/CD MLOps PyTorch TensorFlow MLflow Kubeflow

Hybrid

Save