Software Engineer II | Microsoft Careers

Microsoft

Actively hiring
US Posted 102 days ago $100,600$199,000 / year

At a glance

AI generated

TL;DR

As a Supercomputing Software Engineer on the AI Customer Experience (AICE) team within Microsoft Azure’s High Performance Computing & AI Engineering division, you will play a pivotal role in managing and enhancing the performance of flagship supercomputers used by leading AI customers. Your responsibilities include designing and developing scalable data pipelines to monitor and optimize operations across large-scale infrastructure, ensuring rapid response to critical incidents that impact customer capacity. You will also contribute to improving key metrics such as job mean time to interrupt and node availability, while implementing systemic solutions to complex issues affecting supercomputer performance. Additionally, you will review incident postmortems, enhance troubleshooting guides, and proactively seek new knowledge to improve observability and reliability at scale. The role requires expertise in programming languages like C++, Java, or Python, as well as a deep understanding of high-performance computing environments and the ability to work in a metrics-driven culture focused on customer satisfaction.

Skills

Python C C++ Java JavaScript Azure Kubernetes Docker CI/CD Prometheus Grafana PostgreSQL Redis Git Jenkins Ansible Terraform Linux Nginx Flask Django

What you'll do

  • Design and develop monitoring capabilities for large-scale supercomputer fleets.
  • Create end-to-end data pipelines to process telemetry and logs for actionable alerts.
  • Implement systemic solutions to mitigate complex issues impacting supercomputers.
  • Write incident postmortems and drive changes to reduce future incidents.
  • Independently improve troubleshooting guides, tests, and telemetry for observability.

What we're looking for

  • Requires 4+ years of technical engineering experience with coding in C, C++, C#, Java, JavaScript, or Python.
  • Must have hands-on experience managing operations and troubleshooting complex issues on supercomputers.
  • Develops end-to-end data pipelines to process large volumes of telemetry and log files for actionable alerts.
  • Independently improves troubleshooting guides, wikis, tests, and adds comprehensive observability and monitoring capabilities.
  • Proactively seeks new knowledge in trends, technical solutions, and patterns to enhance the performance and reliability of supercomputers.

Market check

Salary context

This $100,600–$199,000 range sits above 36% of similar postings on FindRole.

Peer median band

$119,800$214,950

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$142,450$196,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 534 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 488 roles with salary data.

Most-posted roles

View all roles at Microsoft

More like this

Similar roles

Software Engineer II | Microsoft Careers

Microsoft

US 144 days ago $100,600$199,000
Azure Python C C++ Java JavaScript CUDA InfiniBand RoCE CI/CD Docker Kubernetes Terraform PostgreSQL Prometheus Grafana

Software Engineer II | Microsoft Careers

Microsoft

US 59 days ago $100,600$199,000
Python C++ Rust Docker Kubernetes Azure CI/CD Prometheus Grafana PostgreSQL Redis Git Linux High Performance Computing Messaging Protocols Networking Hardware Performance Analysis Tools Profiling Tools REST APIs JSON YAML

Software Engineer II | Microsoft Careers

Microsoft

Redmond, WA 3 days ago $102,100$202,200
Python JavaScript C# ReactJS PowerShell CI/CD Docker Kubernetes AWS Azure Git Terraform PostgreSQL MSSQL Linux Windows OAuth OpenID Connect SAML JSON REST API Swagger/OpenAPI

Software Engineer II | Microsoft Careers

Microsoft

WA 83 days ago $100,600$199,000
Python Java JavaScript C# Azure Databricks CI/CD Kubernetes Terraform AWS PostgreSQL SQL Git GitHub Jenkins Prometheus Grafana Docker Linux REST JSON APIs

Software Engineer II | Microsoft Careers

Microsoft

Washington 123 days ago $100,600$199,000
Azure OpenAI SQL NoSQL CosmosDB PostgreSQL MongoDB C C++ Python Kusto SemanticKernel LangChain CI/CD TDD

Software Engineer II | Microsoft Careers

Microsoft

US 96 days ago $100,600$199,000
PostgreSQL Linux Docker Kubernetes Azure CI/CD Python Go C SQL Terraform Git Jenkins Prometheus Grafana Redis MongoDB Hadoop Spark