Research Scientist, AI Evaluation Science

Apple Inc

Quick summary

Work type: On-site
Location: Seattle, WA
Salary: $201,300–$302,200 / yr
Posted: 97 days ago
Nearby: 99+ roles within 25 mi

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $183k

This role $252k

$106k most similar roles pay here $323k

This role pays more than 83% of similar roles. Most pay $126,800–$240,152 — the shaded band above. At the midpoint, this role pays about $252k versus about $183k for comparable roles.

Based on 240 similar postings.

Employer

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software

Apple Inc currently has 1723 open roles on FindRole.

Listed pay typically runs $162,500–$272,100 across 1398 roles with salary data.

Most-posted roles

View all roles at Apple Inc

At a glance

TL;DR · Research Scientist, AI Evaluation Science

Apply Now Log in to save

Join Apple's Services Engineering team as a Research Scientist in AI Evaluation Science, an interdisciplinary role that bridges ML researchers and measurement scientists to develop rigorous evaluation methodologies for large language models and human-AI interactions. You will conduct original research on areas like preference learning, reward modeling, and calibration theory, publish findings at top-tier venues, and collaborate with platform engineers to productionize your methods into SDKs and APIs used across Apple. Ideal candidates have a Ph.D., strong publication records in evaluation-adjacent ML areas, and expertise in implementing complex methods from recent papers. Proficiency in modern ML frameworks like PyTorch or TensorFlow is preferred, along with experience in theoretical foundations of evaluation such as measurement theory and validity frameworks.

Skills

Python PyTorch JAX TensorFlow NeurIPS ICML ICLR ACL EMNLP RLHF DPO Proper scoring rules IRT Active learning Annotation quality Preference elicitation Calibration theory Statistical reliability Decision theory

What you'll do

Advance evaluation methodology through original research in areas like preference learning and reward modeling.
Publish findings at top-tier venues to contribute to the recognition of evaluation science.
Translate complex methods into production-ready tools by partnering with platform engineers.
Collaborate with measurement scientists to integrate psychometric methods into evaluation systems.
Define the team's research agenda by identifying high-leverage open problems in evaluation science.

What we're looking for

Ph.D. in Computer Science, Machine Learning, or related field with focus on evaluation-adjacent areas.
Strong publication record at top-tier conferences like NeurIPS, ICML, ICLR, ACL, EMNLP.
Deep expertise in preference learning, reward modeling, calibration theory, or human-AI interaction methodology.
Ability to implement complex methods from recent papers and run large-scale experiments.
Track record of translating research into practical systems used by others.
Excellent written and verbal communication skills for diverse audiences.

Similar roles

Applied AI Scientist

Apple Inc

Cupertino, CA 50 days ago $181,100–$318,400

TensorFlow PyTorch AWS GCP Azure SageMaker Vertex_AI MLflow SQL Diffusion_models Computer_vision Multimodal_models Video_generation User_behavior_analysis Feature_analytics MLOps Data_drift_tracking Version_control Testing Code_review

Save

Applied AI Scientist

Apple Inc

Culver City, CA 50 days ago $171,600–$302,200

TensorFlow PyTorch AWS GCP Azure SageMaker Vertex_AI MLflow SQL CNN RNN Transformers Diffusion_models Computer_vision Multimodal_models Video_generation Data_drift Model_monitoring Version_control Testing Code_review CI/CD

Save

Applied AI Scientist

Apple Inc

San Diego, CA 50 days ago $171,600–$302,200

Save

Applied AI Scientist

Apple Inc

Seattle, WA 50 days ago $171,600–$302,200

Save

Senior Staff AI Research Scientist

Intuit

Mountain View, CA 57 days ago $226,000–$306,000

Python PyTorch TensorFlow NeurIPS ICML ICLR AAAI KDD ACL Decision-focused AI Probabilistic modeling Causal inference Simulation-based planning Agentic and multi-agent systems Neuro-symbolic AI LLM-based reasoning Deep learning Optimization Statistical machine learning

Save

Distinguished AI Scientist

Intuit

Mountain View, CA 56 days ago $314,500–$425,500

Python TensorFlow PyTorch AWS GCP Azure MLOps Reinforcement Learning LLMs Multi-modal Models CI/CD Data Structures Algorithms A/B Testing Prometheus Grafana

Save