Principal Machine Learning Engineer | Microsoft Careers

Microsoft

Hybrid Actively hiring
US Posted 48 days ago $139,900$274,800 / year

At a glance

AI generated

TL;DR

As a Principal Machine Learning Engineer at Microsoft’s MAI organization, you will join a dynamic team focused on data labeling and classification for multi-modal Copilot data. Your role involves prototyping and operationalizing complex classification flows on production logs, ensuring secure and compliant data-labeling pipelines are in place. Day-to-day responsibilities include building evaluation loops, generalizing ML solutions into frameworks, and operating prompted classifiers at scale with robust monitoring and cost management. You will collaborate closely with Data Science, Security, and Platform teams to define schemas and governance, while independently writing efficient code for model pipelines. Ideal candidates have 7+ years of experience in Python or Java/Scala, extensive knowledge in distributed systems, and expertise in ML data pipelines using tools like AML, Promptflow, Langchain, or LangGraph. This role demands a strong background in Responsible AI and working with large language models to address complex business challenges at scale.

Skills

Python Java Scala AML Promptflow Langchain LangGraph Distributed Systems Large Scale Data Processing ML Data Pipelines Responsible AI Large Language Models

What you'll do

  • Build evaluation loops for precision/recall, calibration, drift, and human-in-the-loop processes.
  • Generalize ML solutions into repeatable frameworks for scalable deployment.
  • Operationalize prompted classifiers at scale, including batch and streaming operations.
  • Write efficient, readable, extensible code and model pipelines independently.
  • Conduct thorough reviews of data analysis techniques to ensure accuracy and completeness.
  • Collaborate with Security and Platform teams to define schemas and governance standards.

What we're looking for

  • 7+ years of experience writing production-quality Python, Java, or Scala code.
  • 5+ years of experience in designing and implementing distributed systems for large-scale data processing.
  • 3+ years of experience building ML data pipelines using AML, Promptflow, Langchain, or LangGraph.
  • Proven expertise in responsible AI practices.
  • Experience with prompting, evaluating, and working with large language models.

Market check

Salary context

This $139,900–$274,800 range sits above 42% of similar postings on FindRole.

Peer median band

$161,800$272,100

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$194,000$250,250

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 534 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 488 roles with salary data.

Most-posted roles

View all roles at Microsoft

More like this

Similar roles

Machine Learning Software Engineer | Microsoft Careers

Microsoft

US 48 days ago $139,900$274,800
Python JavaScript C++ Java Kubernetes AWS Azure Docker CI/CD PostgreSQL MongoDB Redis Apache Spark TensorFlow PyTorch Prometheus Grafana Git Jenkins Responsible AI Scikit-learn
Hybrid

Principal Machine Learning Engineer

Intuit

Mountain View, CA 45 days ago $254,500$344,000
Python TensorFlow PyTorch Java Scala Docker Kubernetes AWS CI/CD MLOps PostgreSQL Redis Git Jenkins Prometheus Grafana

Principal Machine Learning Engineer

PayPal

San Jose, CA 74 days ago $242,000$359,150
Python TensorFlow PyTorch Spark BigQuery Airflow dbt Kubernetes AWS Google Cloud CI/CD Docker Prometheus Grafana Redis PostgreSQL MongoDB GraphQL REST_API Swagger
Hybrid

Principal Machine Learning Engineer

Zillow

Remote (Remote-Usa, US) 17 days ago $204,400$326,600
Python LangGraph LangChain AgentsSDK AutoGen Spark Databricks Airflow Temporal AWS CI/CD LLM-based systems Vector stores Observability Elasticsearch Kubernetes
Remote

Principal Machine Learning Engineer

Cisco

Remote (San Jose, CA) 4 days ago $291,500$369,100
Python PyTorch TensorFlow NLP Log Analytics Anomaly Detection Multi-Modal AI Modeling Distributed Training MLOps CI/CD Prometheus Grafana Kubernetes AWS Azure
Remote