Research Intern - AI Evaluation and Alignment

Microsoft

Quick summary

Work type
On-site
Location
Redmond, WA
Employment
Intern
Posted
138 days ago
Closes
Jul 15, 2026

Market check

Salary context

How this pay compares to similar roles

Similar $204k
$152k most similar roles pay here $260k

This listing doesn't post a salary. Most similar roles pay $162,168–$246,150.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 571 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 522 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Research Intern - AI Evaluation and Alignment

Join our dynamic research team as a PhD candidate specializing in machine learning, where you will co-develop cutting-edge projects with supervisors and mentors, focusing on designing and implementing advanced ML approaches using real-world datasets. Your daily tasks include training and fine-tuning large language models (LLMs), developing evaluation frameworks to assess model performance, and presenting your findings effectively. Ideal candidates hold a PhD in Statistics, Computer Science, Physics, or Operations Research and have at least one year of hands-on experience with LLM-related projects such as prompt engineering and rewards modeling. Proficiency in Python, deep learning frameworks like PyTorch and TensorFlow, and software engineering best practices is essential. This role requires strong independent problem-solving skills and the ability to collaborate across disciplines on complex research challenges.

What you'll do

  • Design and implement machine learning approaches using real-world datasets.
  • Train and fine-tune models for large language tasks like prompt engineering.
  • Develop evaluation frameworks to assess model robustness and generalization.
  • Present research findings on machine learning projects and methodologies.
  • Code in Python and utilize deep learning frameworks like PyTorch or TensorFlow.

What we're looking for

  • PhD student in Statistics, Computer Science, Physics, Operations Research, or related field.
  • At least 1 year of hands-on experience with large language model (LLM) projects.
  • Strong Python coding skills and proficiency with deep learning frameworks like PyTorch and TensorFlow.
  • Experience in developing reward models for LLMs or using LLMs as judges.
  • Demonstrated research experience through publications or significant projects.
  • Ability to work independently and collaborate effectively across disciplines.

More like this

Similar roles

Research Intern - AI Frontiers

Microsoft

Redmond, WA 28 days ago
Python TensorFlow PyTorch DeepLearning LargeLanguageModels ReinforcementLearning CI/CD Git JupyterNotebook GitHub GoogleColab PostgreSQL MongoDB

PhD Research Intern, Generative AI - 2026

Nvidia

Santa Clara, CA 8 days ago
Python PyTorch Distributed Training Simulation Video Models VLMs World Models Robotics Autonomous Driving Reinforcement Learning Computer Vision Multimodal Learning

AI/ML Staff Researcher

General Motors (GM)

Mountain View, CA 8 days ago
Python TensorFlow PyTorch Keras Scikit-learn AWS Azure Google Cloud Platform CI/CD Docker Kubernetes Git Jupyter Notebook PostgreSQL MongoDB Responsible AI Large Language Models Generative AI Physics-based AI Scientific Machine Learning
Hybrid