Senior Data Scientist | Microsoft Careers

Microsoft

Hybrid

Quick summary

Work type
Hybrid
Location
Redmond, WASan Francisco, CANew York, NY
Salary
$119,800–$234,700 / yr
Posted
4 days ago
Closes
Dec 7, 2026

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $181k
This role $177k
$106k most similar roles pay here $248k

This role pays more than 56% of similar roles. Most pay $152,753–$209,187 — the shaded band above. At the midpoint, this role pays about $177k versus about $181k for comparable roles.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 1568 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 1397 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Senior Data Scientist | Microsoft Careers

As a Senior Data Scientist for LLM Evaluation at Copilot, you will join a dynamic team focused on enhancing AI systems to better serve users across various needs. Your primary responsibilities include developing innovative methods to evaluate the performance of large language models (LLMs), training classifiers, and implementing real-time monitoring frameworks. You will collaborate closely with user researchers and product leaders to create automated evaluation tools that drive continuous improvements in Copilot’s capabilities. Key skills required are expertise in social sciences, machine learning, natural language analysis, and experience with LLMs. The role demands a creative problem solver who can navigate complex scenarios, independently shape project direction, and deliver results efficiently. This position involves working on large-scale systems serving millions of users, requiring proficiency in data mining, prompt engineering, and classifier training to ensure Copilot meets both functional and emotional user needs.

What you'll do

  • Develop new methods to evaluate LLM performance in real-world scenarios.
  • Train classifiers and experiment with data collection techniques for evaluation.
  • Implement methodologies to provide real-time signals on AI system performance.
  • Create comprehensive automated testing systems for diverse usage cases.
  • Maintain a user-oriented perspective by understanding and validating approaches through research.
  • Track advances in research and adapt algorithms to drive innovation.

What we're looking for

  • Bachelor's degree in a quantitative field plus 5+ years of data science experience.
  • Expertise in evaluating large language models (LLMs) and prompt engineering.
  • Proficiency in developing automated evaluation frameworks and testing systems.
  • Strong background in machine learning, natural language analysis, and social sciences.
  • Ability to independently solve complex problems and deliver results.
  • Experience in tracking research advances and adapting algorithms for production.

More like this

Similar roles

Senior Data Scientist | Microsoft Careers

Microsoft

US 116 days ago $119,800$234,700
Python SQL Azure DevOps CI/CD Git ResponsibleAI TensorFlow PyTorch Scikit-learn Pandas NumPy JupyterNotebook PowerBI Tableau Kubernetes Docker Prometheus Grafana

Senior Data Scientist | Microsoft Careers

Microsoft

Washington 19 days ago $119,800$234,700
Python R SQL Causal Inference Experimental Design Machine Learning Econometrics Statistics CI/CD Azure AWS Kubernetes Terraform PostgreSQL MLOps

Senior Data Scientist | Microsoft Careers

Microsoft

US 10 days ago $119,800$234,700
Python SQL Azure Responsible AI DevOps Version Control Testing Agentic AI Machine Learning Generative AI Prompt Engineering Large Language Models Data Visualization Cloud Platforms CI/CD

Senior Data Scientist | Microsoft Careers

Microsoft

Redmond, WA 6 days ago $119,800$234,700
Python SQL Java C# MachineLearning DataAnalysis StatisticalMethods LargeDatasets SearchEngines OnlineInstrumentation A/BTesting ForecastModels
Hybrid