| Microsoft Careers

Microsoft

Quick summary

Work type
On-site
Location
Mountain View, CA · Redmond, WA
Salary
$165,600–$296,400 / yr
Posted
141 days ago
Closes
Jul 14, 2026

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $192k
This role $231k
$125k most similar roles pay here $315k

This role pays more than 84% of similar roles. Most pay $165,000–$219,425 — the shaded band above. At the midpoint, this role pays about $231k versus about $192k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 310 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 285 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · | Microsoft Careers

As a Data Engineering IC6 on the AI model training team, you will design and develop robust data pipelines for ingesting vast amounts of multi-modal training data including text, audio, images, and video. Your day-to-day responsibilities include owning and maintaining critical data infrastructures such as Spark and Ray, while also building scalable storage solutions to handle petabytes of data. You will collaborate closely with pretraining and post-training teams to enhance the data recipe through rigorous experimentation, ensuring compliance with data governance standards. This role requires expertise in data engineering, software development, and a deep understanding of data security and compliance. Ideal candidates have extensive experience in designing complex data systems and possess strong skills in Python, SQL, and distributed computing frameworks.

What you'll do

  • Design and develop data pipelines for multi-modal training data ingestion.
  • Own and maintain critical data infrastructures like Spark, Ray, and vector databases.
  • Build infrastructure capable of storing and processing petabytes of data for models.
  • Improve data recipes through rigorous and careful experimentation with teams.
  • Ensure compliance with data governance, security, and regulatory standards.

What we're looking for

  • Master's degree in a relevant field and 6+ years of experience, or Bachelor's degree and 8+ years of experience in related areas.
  • Design and develop large-scale data pipelines for multi-modal training data ingestion.
  • Maintain critical data infrastructures including Spark, Ray, vector databases, etc.
  • Build infrastructure to store and process petabytes of data for AI models.
  • Experience with data governance, compliance, and security.

More like this

Similar roles

Data Research Engineer | Microsoft Careers

Microsoft

US 179 days ago $119,800$234,700
Python Pandas NumPy Spark Ray Apache_Beam SQL CI/CD Git Jupyter_Notebook TensorFlow PyTorch PostgreSQL MongoDB Docker Kubernetes AWS Google_Cloud_Platform Azure GitHub
Hybrid

Senior Member of Technical Staff (AI Infrastructure)

Oracle

Austin, TX 24 days ago $79,200$178,100
Python Java C++ Docker Kubernetes Terraform AWS Azure Oracle Cloud Infrastructure CI/CD Git PostgreSQL NoSQL Distributed Systems Cloud-Native Services Performance Optimization Troubleshooting Automation Tooling DevOps

Infrastructure Data & Analytics | Microsoft Careers

Microsoft

California 116 days ago $142,800$274,800
Python SQL Distributed_data_processing_frameworks ETL_orchestration Data_warehousing Self_service_dashboards API_design Cloud_services Data_quality_control Data_governance Metric_standardization CI/CD Kubernetes Terraform Prometheus Grafana
Hybrid

Consulting Member of Technical Staff

Oracle

US 41 days ago $96,800$251,600
Java C++ Python Perl Docker Kubernetes AWS Oracle Cloud Infrastructure CI/CD PostgreSQL MySQL Redis MongoDB Terraform Ansible Git Jenkins Prometheus Grafana

Consulting Member of Technical Staff

Oracle

Austin, TX 22 days ago $96,800$251,600
Python Java Go Rust TypeScript Kubernetes CI/CD Docker Prometheus Grafana AWS Azure Google Cloud Platform PostgreSQL MongoDB Redis Git GitHub Jenkins Terraform ChatGPT Claude Copilot Cursor Codex LLM Prompt engineering RAG AgentOps LLMOps Model Context Protocol Secure coding Dependency management Data privacy Incident-aware engineering

Lead Principal Software Engineer

Oracle

Austin, TX 17 days ago $96,800$251,600
Java Scala Python Kubernetes Terraform Linux CI/CD Docker Prometheus Grafana PostgreSQL AWS Azure Oracle Cloud Infrastructure Git Jenkins Ansible Chef JSON YAML REST GraphQL Spring Boot Hibernate DynamoDB RDS S3 Lambda VPC IAM ECS EKS Vault Kafka Zookeeper Redis MongoDB MySQL Oracle Database