Research Intern - AI Evaluation and Alignment

Microsoft

Quick summary

Work type: On-site
Location: Redmond, WA
Employment: Intern
Posted: 138 days ago
Closes: Jul 15, 2026
Nearby: 99+ roles within 25 mi

Market check

Salary context

How this pay compares to similar roles

Similar $204k

$152k most similar roles pay here $260k

This listing doesn't post a salary. Most similar roles pay $162,168–$246,150.

Based on 240 similar postings.

Employer

About Microsoft

Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing

Microsoft currently has 571 open roles on FindRole.

Listed pay typically runs $119,800–$234,700 across 522 roles with salary data.

Most-posted roles

View all roles at Microsoft

At a glance

TL;DR · Research Intern - AI Evaluation and Alignment

Apply Now Log in to save

Join our dynamic research team as a PhD candidate specializing in machine learning, where you will co-develop cutting-edge projects with supervisors and mentors, focusing on designing and implementing advanced ML approaches using real-world datasets. Your daily tasks include training and fine-tuning large language models (LLMs), developing evaluation frameworks to assess model performance, and presenting your findings effectively. Ideal candidates hold a PhD in Statistics, Computer Science, Physics, or Operations Research and have at least one year of hands-on experience with LLM-related projects such as prompt engineering and rewards modeling. Proficiency in Python, deep learning frameworks like PyTorch and TensorFlow, and software engineering best practices is essential. This role requires strong independent problem-solving skills and the ability to collaborate across disciplines on complex research challenges.

Skills

Python PyTorch TensorFlow git LLM-related_projects reward_models LLM_post-training_and_evaluation LLM-as-a-Judge

What you'll do

Design and implement machine learning approaches using real-world datasets.
Train and fine-tune models for large language tasks like prompt engineering.
Develop evaluation frameworks to assess model robustness and generalization.
Present research findings on machine learning projects and methodologies.
Code in Python and utilize deep learning frameworks like PyTorch or TensorFlow.

What we're looking for

PhD student in Statistics, Computer Science, Physics, Operations Research, or related field.
At least 1 year of hands-on experience with large language model (LLM) projects.
Strong Python coding skills and proficiency with deep learning frameworks like PyTorch and TensorFlow.
Experience in developing reward models for LLMs or using LLMs as judges.
Demonstrated research experience through publications or significant projects.
Ability to work independently and collaborate effectively across disciplines.

Similar roles

Research Intern - AI Frontiers

Microsoft

Redmond, WA 28 days ago

Python TensorFlow PyTorch DeepLearning LargeLanguageModels ReinforcementLearning CI/CD Git JupyterNotebook GitHub GoogleColab PostgreSQL MongoDB

Save

Research Intern - Self-Improving AI

Microsoft

Cambridge, MA 16 days ago

Python PyTorch huggingface-transformers vLLM deep-learning language-modeling reinforcement-learning

Save

Research Intern - AI Frameworks (Network Systems and Tools)

Microsoft

Redmond, WA 180 days ago

Python C C++ PyTorch CUDA Triton Docker Kubernetes CI/CD Prometheus Grafana PostgreSQL Git Linux AWS Azure Google Cloud Platform

Save

PhD Research Intern, Generative AI - 2026

Nvidia

Santa Clara, CA 8 days ago

Python PyTorch Distributed Training Simulation Video Models VLMs World Models Robotics Autonomous Driving Reinforcement Learning Computer Vision Multimodal Learning

Save

AI and Systems Software Intern, At Scale AI - Fall 2026

Nvidia

Santa Clara, CA 8 days ago

Python Bash Kubernetes Slurm Prometheus Grafana ELK_stack strace gdb perf Linux HPC PCIe NVLink CI/CD

Save

AI/ML Staff Researcher

General Motors (GM)

Mountain View, CA 8 days ago

Python TensorFlow PyTorch Keras Scikit-learn AWS Azure Google Cloud Platform CI/CD Docker Kubernetes Git Jupyter Notebook PostgreSQL MongoDB Responsible AI Large Language Models Generative AI Physics-based AI Scientific Machine Learning

Hybrid

Save