Senior Software Engineer - AI Frameworks

Microsoft

Quick summary

Work type
On-site
Location
US
Salary
$119,800–$234,700 / yr
Posted
10 days ago
Closes
Dec 14, 2026

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $201k
This role $177k
$106k most similar roles pay here $250k

This role pays less than 60% of similar roles. Most pay $167,249–$235,750 — the shaded band above. At the midpoint, this role pays about $177k versus about $201k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 622 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 559 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Senior Software Engineer - AI Frameworks

The Senior Software Engineer role at Microsoft's AI Frameworks team involves architecting and implementing efficient tensor computation primitives for custom AI accelerators while extending PyTorch features to optimize model execution on these devices. Day-to-day responsibilities include developing high-performance kernels for NPUs (MAIA) and GPUs, contributing to inference stacks like vLLM and SGLang, and collaborating with cross-functional teams to define requirements and deliver innovative solutions. The ideal candidate will have expertise in C++, Python, and GPU kernel development, along with experience in PyTorch internals and large-scale model serving systems. This role offers the opportunity to work on cutting-edge AI infrastructure that powers Microsoft’s advanced custom silicon and models, addressing complex technical challenges at scale.

What you'll do

  • Architect and implement efficient tensor computation primitives for custom AI accelerators.
  • Develop PyTorch features to optimize model execution on custom AI hardware.
  • Improve scheduling and KV cache management in AI inference stacks like vLLM.
  • Design high-performance kernels for NPUs and GPUs to accelerate LLM workloads.
  • Profile and optimize software components for better performance and efficiency.
  • Collaborate with cross-functional teams to define technical requirements and solutions.

What we're looking for

  • 4+ years of technical engineering experience in C, C++, or Python.
  • Bachelor's Degree in Computer Science or related field.
  • Experience with PyTorch internals and custom operators.
  • Proficiency in developing AI inference stacks like vLLM or SGLang.
  • Expertise in NPU/GPU kernel development and optimization (CUDA, Triton).
  • Knowledge of LLM concepts including attention mechanisms and KV caching.

More like this

Similar roles

Senior Software Engineer, AI Core Engineering

The Walt Disney Company

Remote 123 days ago $141,900$190,300
Python LLM APIs AWS Bedrock Azure AI Foundry LangChain LangGraph APIs SDKs OpenAI Anthropic Claude Observability Tracing Latency and cost dashboards Drift detection Multi-agent orchestration Synthetic data Enterprise governance Security Compliance Audit Policy enforcement
Remote

Senior Software Engineer, AI

Blackline

Pleasanton, CA 3 days ago
Python Java C++ TensorFlow PyTorch Kubernetes Docker AWS Azure CI/CD Git SQL NoSQL Scikit-learn Pandas NumPy Jupyter Linux REST APIs
Hybrid

Senior AI Software Engineer

T. Rowe Price

New York, NY +1 16 days ago $121,000$206,000
Python Java JavaScript AWS Azure React Angular Docker Kubernetes CI/CD Prometheus Grafana PostgreSQL Redis Git Jenkins
Hybrid

Senior Software Engineer, AI Platform Team

Coinbase

Remote 4 days ago
Python JavaScript AWS Kubernetes Docker CI/CD PostgreSQL Prometheus Grafana Terraform LLM AI FinOps MCP Vector Markdown GraphQL Chatbots Low-code workflows Traditional ML Fine-tuning Prompting
Remote

Senior Software Engineer, AI Platforms

JLL (Jones Lang LaSalle)

Boston, MA +3 26 days ago $120,000$200,000
Python Node.js React Azure AWS GCP Docker Kubernetes CI/CD Git GitHub SQL Server CosmosDB Vector databases RAG systems Prompt engineering LLM integration Automated testing frameworks
Hybrid