Principal Software Engineer

Microsoft

Actively hiring
US Posted 66 days ago $139,900$274,800 / year

At a glance

AI generated

TL;DR

As a Senior Software Engineer on the supercomputing infrastructure team, you will architect and develop high-volume, low-latency event pipelines to provide real-time insights into job interruptions and reliability. Your daily tasks include analyzing existing pipelines for critical events, collaborating with data scientists and domain experts to enhance key metrics like Mean Time to Interrupt, and partnering across teams to design end-to-end solutions that manage core infrastructure technologies such as datacenter hardware and power systems. You will drive engineering excellence by addressing strategic customer issues and leading initiatives to minimize future impact through continuous learning programs. The role requires expertise in languages like C++, Java, or Python, along with extensive experience in AI/HPC systems and cloud infrastructure operations.

Skills

Python Java JavaScript C C++ Kubernetes Docker CI/CD PostgreSQL AWS Azure GoogleCloud HPC AI Telemetry PowerManagement CoolingSystems DataCenterOperations

What you'll do

  • Design and develop high-volume low-latency event pipelines for real-time insights into job interruptions.
  • Analyze existing event pipelines to assess fidelity, granularity, and latency of critical events.
  • Improve key metrics by enabling data scientists to use telemetry for issue identification and resolution.
  • Partner with cross-functional teams to design and deploy end-to-end solutions for managing core infrastructure.
  • Drive engineering excellence by addressing issues from strategic customers and enhancing product features.
  • Lead complex incident resolutions and champion initiatives to minimize future customer impact.

What we're looking for

  • Bachelor's Degree in Computer Science or related field with 6+ years coding experience in C, C++, Java, Python.
  • Architect and develop high volume low latency event pipelines for critical insights on job interruptions and reliability.
  • Conduct analysis of existing event pipelines to improve key metrics like Mean Time to Interrupt and Mean Time to Resolve.
  • Partner with cross-functional teams to design and deploy end-to-end solutions for managing core infrastructure technologies.
  • Drive engineering excellence by resolving complex incidents, conducting root cause analyses, and minimizing future customer impact.

Market check

Salary context

This $139,900–$274,800 range sits above 54% of similar postings on FindRole.

Peer median band

$143,000$245,250

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$165,000$214,500

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 445 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 415 roles with salary data.

Most-posted roles

View all roles at Microsoft

More like this

Similar roles

Principal Software Engineer

Cisco

Remote (Usa-San Jose, US) 87 days ago $231,400$331,800
Python C++ ASIC development Networking function implementation CI/CD PostgreSQL Kubernetes AWS Docker Prometheus Grafana P4 programming SDK development Linux操作系统 Git Jira Confluence
Remote

Principal Software Engineer

The Walt Disney Company

Remote (Usa - Ca - 2450 Broadway, US) 52 days ago $184,300$247,100
Python Java Django Springboot AWS Kinesis DynamoDB S3 SNS SQS MySQL Postgres Kafka CI/CD Agile ML/AI
Remote

Principal Software Engineer

Intuit

New York, New York, US 44 days ago $261,000$353,000
Python Java JavaScript React Node.js Docker Kubernetes AWS Azure CI/CD Git PostgreSQL MongoDB Agile Scrum

Principal Software Engineer

Oracle

US 44 days ago $96,800$223,400
Java Python Linux Docker Kubernetes Terraform CI/CD Prometheus Grafana PostgreSQL AWS Azure Oracle Cloud Infrastructure BMCs NICs SmartNICs ILOMs GPUs Microservices Observability High Availability Security Networking Compute Distributed Systems Firmware Development Testing

Principal Software Engineer

Intuit

Mountain View, California, US 44 days ago $261,500$353,500
Python Java JavaScript Docker Kubernetes AWS CI/CD PostgreSQL MongoDB Redis Git Jenkins Swagger RESTful_APIs

Principal Software Engineer

The Walt Disney Company

Remote (Usa - Fl - 215 Celebration Place, US) 45 days ago
AWS Azure Git CI/CD Kubernetes DevOps AppDynamics Splunk Jira ServiceNow Confluence TOGAF Snowflake Agile SAFe
Remote