| Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Salary
$102,100–$202,200 / yr
Posted
42 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $181k
This role $152k
$86k most similar roles pay here $251k

This role pays less than 76% of similar roles. Most pay $155,000–$207,350 — the shaded band above. At the midpoint, this role pays about $152k versus about $181k for comparable roles.

Based on 238 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1103 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 985 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · | Microsoft Careers

As a Software Engineer 2 on the AI Core Inferencing team at Microsoft Azure's AI Inference platform, you will play a pivotal role in designing and implementing core infrastructure to serve cutting-edge large language models (LLMs) and generative AI models from OpenAI and other providers. Your responsibilities include optimizing end-to-end inference performance, developing efficient load scheduling strategies, scaling the platform for high demand, and delivering critical capabilities for new Gen AI models like GPT5 and Sora. You will collaborate with both internal teams and external partners to ensure seamless integration and high availability of these systems. The ideal candidate has a strong background in software engineering principles, experience with distributed computing, and proficiency in languages such as C++, Java, or Go. This role involves working on large-scale, real-time online services that require low latency and high throughput, making it an exciting opportunity to influence the future of AI at scale.

What you'll do

  • Design and implement core inference infrastructure for serving advanced AI models in production.
  • Identify and drive improvements to end-to-end performance and efficiency of state-of-the-art LLMs and GenAI models.
  • Develop efficient load scheduling and balancing strategies for high-throughput, low-latency environments.
  • Scale the platform to support growing inferencing demand while maintaining high availability.
  • Deliver critical capabilities required to serve cutting-edge Gen AI models quickly.
  • Collaborate with internal and external partners to drive new features and platform capabilities.

What we're looking for

  • 2+ years of technical engineering experience in software development using C, C++, C#, Java, or Golang.
  • Experience with high-scale, reliable online systems and real-time services requiring low latency and high throughput.
  • Knowledge of network architecture, including HTTP, TCP protocols, authentication, and session management.
  • Proficiency in OSS, Docker, Kubernetes, and experience with C++ or Golang.
  • Ability to independently lead projects and collaborate effectively with internal and external partners.
  • Strong foundation in software engineering principles, distributed computing, and system architecture.

More like this

Similar roles

| Microsoft Careers

Microsoft

Redmond, WA +2 58 days ago $142,800$274,800
Azure Kubernetes Docker CI/CD Python PostgreSQL Terraform Prometheus Grafana Git Jira Swagger RESTful APIs JSON YAML DevOps Scrum Agile
Hybrid

| Microsoft Careers

Microsoft

US 57 days ago $102,100$202,200
Intune Microsoft Azure Windows 11 iOS Android SCIM Terraform Docker CI/CD Kubernetes PostgreSQL Python Prometheus Grafana AI Agentic AI

| Microsoft Careers

Microsoft

WA 76 days ago $119,800$234,700
Python TypeScript Golang Java C# Scala Rust React Next.js AI/ML systems C#/Java Model pretraining Post training Evaluation Inference CI/CD

| Microsoft Careers

Microsoft

Redmond, WA 51 days ago
Python TensorFlow PyTorch DeepLearning ComputerVision SelfSupervisedLearning MixtureOfExperts DenseVisionProblems CVPR NeurIPS ICML ICCV ECCV AAAI IJCAI 3DV IEEETransactions ACMTransactions IJCV

| Microsoft Careers

Microsoft

US 28 days ago $130,900$251,900
AI Cloud Security CI/CD Data Protection Model Protection Runtime Monitoring Agent Security Market Trends Competitive Intelligence Vendor Analysis Technical Communication Cybersecurity AI Governance Risk Management Product Marketing Strategic Market Analysis

| Microsoft Careers

Microsoft

US 35 days ago $119,800$234,700
Azure Kubernetes AWS Terraform Docker CI/CD PostgreSQL Python Go Prometheus Grafana Git Jenkins Ansible Linux Windows_Server VMware Cisco_Networking Nutanix OpenStack Scrum Agile