Senior Software Engineer, AI Infrastructure (Scheduler)

Microsoft

Quick summary

Work type
On-site
Location
Salary
$119,800–$234,700 / yr
Posted
82 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $202k
This role $177k
$104k most similar roles pay here $269k

This role pays less than 68% of similar roles. Most pay $167,550–$235,750 — the shaded band above. At the midpoint, this role pays about $177k versus about $202k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 622 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 571 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Senior Software Engineer, AI Infrastructure (Scheduler)

The Senior Software Engineer - AI Infrastructure (Scheduler) role at the Azure AI Platform organization involves designing and developing distributed services that manage large-scale AI training and inferencing, focusing on enhancing system stability and efficiency. This position requires expertise in C#, .Net, and experience with Kubernetes or Service Fabric for hosting control plane services. The ideal candidate will have a deep understanding of machine learning concepts and collaborate closely with internal teams to build robust solutions. Responsibilities include operational support, fostering technical leadership within the team, and ensuring high service reliability through rigorous engineering practices. Candidates should possess advanced knowledge in AI infrastructure, large-scale distributed systems, and experience with global multi-tenant services, making this role critical for managing Azure’s GPU and NPU capacity across regions without compromising on security or performance.

What you'll do

  • Design and develop core AI Infrastructure distributed services for large-scale AI training and inferencing.
  • Enhance control plane services to ensure high stability, efficiency, low latency, and tight cloud security.
  • Provide operational support and act as DRI (on-call) for the service.
  • Foster a deep understanding of machine learning concepts and use cases among team members.
  • Develop and maintain systems with rigorous engineering practices and data-driven problem-solving skills.
  • Collaborate closely with internal teams to build better solutions for AI Platform partner services.

What we're looking for

  • Bachelor's Degree in Computer Science or related field with 4+ years of technical experience.
  • Advanced knowledge of C# and .Net, including proficiency in OOP and design patterns.
  • 3+ years of hands-on experience with large-scale distributed systems and cloud services.
  • Expertise in managing complex codebases and implementing rigorous unit testing practices.
  • Demonstrated ability to build high-availability global services and manage critical control plane services.
  • Strong background in AI infrastructure, workload management, and technical leadership.

More like this

Similar roles

Senior Software Engineer, CoreAI Workload Engines

Microsoft

81 days ago $119,800$234,700
Python Kubernetes PyTorch CUDA Prometheus Grafana CI/CD Docker PostgreSQL Redis OpenAI Azure NVIDIA GPUs RDMA InfiniBand RoCE NCCL TensorFlow Hadoop Apache Spark GitLab Jenkins

Senior Software Engineer, Responsible AI

Microsoft

64 days ago $119,800$234,700
Azure Kubernetes Docker Python C# JavaScript SQL CI/CD Terraform Prometheus Grafana Git GitHub DevOps REST Swagger OpenAPI PostgreSQL Redis MongoDB GraphQL

Senior Software Engineer, AI Platform

Smartly

Helsinki, Finland 72 days ago
Python TypeScript PostgreSQL Node.js Docker Kubernetes React AWS GCP CI/CD MLOps PyTorch TensorFlow MLflow Kubeflow
Hybrid

Senior Software Engineer, AI Core Engineering

The Walt Disney Company

Remote 123 days ago $141,900$190,300
Python LLM APIs AWS Bedrock Azure AI Foundry LangChain LangGraph APIs SDKs OpenAI Anthropic Claude Observability Tracing Latency and cost dashboards Drift detection Multi-agent orchestration Synthetic data Enterprise governance Security Compliance Audit Policy enforcement
Remote

Senior Software Engineer, AI Platform Team

Coinbase

Remote 2 days ago $186,065$218,900
Python JavaScript AWS Kubernetes Docker CI/CD PostgreSQL Prometheus Grafana Terraform LLM AI FinOps MCP Vector Markdown GraphQL Chatbots Low-code workflows Traditional ML Fine-tuning Prompting
Remote