Research Scientist / Engineer, Foundation Model Evaluation

Apple Inc

Quick summary

Work type: On-site
Location: Cupertino, CA
Salary: $181,100–$318,400 / yr
Posted: 56 days ago
Nearby: 99+ roles within 25 mi

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $194k

This role $250k

$121k most similar roles pay here $340k

This role pays more than 79% of similar roles. Most pay $142,450–$245,900 — the shaded band above. At the midpoint, this role pays about $250k versus about $194k for comparable roles.

Based on 240 similar postings.

Employer

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software

Apple Inc currently has 1723 open roles on FindRole.

Listed pay typically runs $162,500–$272,100 across 1398 roles with salary data.

Most-posted roles

View all roles at Apple Inc

At a glance

TL;DR · Research Scientist / Engineer, Foundation Model Evaluation

Apply Now Log in to save

As a Research Scientist or Engineer in the Foundation Model Evaluation team at Apple, you will play a critical role in enhancing models that power daily-used products. Your responsibilities include designing and implementing rigorous evaluation systems for large language models across reasoning, knowledge, code, and agentic workflows, ensuring these evaluations reflect real user experiences. You will collaborate closely with model training and product teams to integrate actionable insights into the development cycle, driving continuous improvement. The ideal candidate has a strong background in AI model evaluation, NLP, and machine learning, with proficiency in Python and ML frameworks like PyTorch or JAX. Experience in experimental design, human evaluation methodology, and building reusable evaluation tooling is highly valued. This role involves working on cutting-edge technologies to address complex challenges at scale, impacting the quality of products used by billions globally.

Skills

Python PyTorch JAX ML frameworks NLP Statistical analysis Experimental design Technical communication CI/CD

What you'll do

Design and implement evaluation benchmarks for AI models.
Develop methods to measure model performance in real product settings.
Research and apply state-of-the-art evaluation techniques and build tools.
Execute rigorous experiments comparing model capabilities and analyze results.
Translate research insights into actionable recommendations for training strategies.

What we're looking for

3+ years of AI model evaluation, NLP, or related field experience
Strong machine learning, NLP, and statistical analysis fundamentals
Proficiency in Python and ML frameworks like PyTorch or JAX
Ability to translate research insights into practical implementations
Expertise in designing rigorous experiments and drawing valid conclusions

Similar roles

Principal Applied Research Engineer/Scientist

Apple Inc

Cupertino, CA 88 days ago $212,000–$386,300

Python PyTorch TensorFlow Transformer LLMs Multi-modal_models Cross-modal_attention CI/CD MLOps AWS Kubernetes Docker Git Scikit-learn PostgreSQL Redis NVIDIA_GPU CUDA AutoML Ray

Save

AIML Researcher/Engineer - Foundation Model Post-Training

Apple Inc

Cupertino, CA 15 days ago

Python PyTorch JAX Reinforcement Learning LLMs Distributed Training Transformers Curriculum Learning Evaluation Methodologies Deep Learning Data Generation Automated Data Filtering

Save

AIML Researcher/Engineer - Foundation Model Post-Training

Apple Inc

Seattle, WA 8 days ago

Python PyTorch JAX Reinforcement_Learning LLMs Distributed_Training Transformers_Architecture Curriculum_Learning Evaluation_Methodologies Deep_Learning CI/CD

Save

AIML Researcher/Engineer - Foundation Model Post-Training

Apple Inc

New York City, NY 15 days ago

Python PyTorch JAX Reinforcement Learning LLMs Distributed Training Transformers Curriculum Learning Evaluation Methodologies Data Generation Automated Data Filtering

Save

Senior Research Engineer, Foundation Model Training Infrastructure

Nvidia

Santa Clara, CA 154 days ago $224,000–$356,500

PyTorch TensorFlow JAX Kubernetes Python C++ CUDA SLURM HPC GPU Distributed Systems Multimodal Data Processing Monitoring Tools Debugging Tools Large Scale Clusters CI/CD

Save

Research Engineer

Adobe

Seattle 85 days ago $146,300–$211,850

Python PyTorch TensorFlow JAX C++ TensorRT AITemplate CoreML WinML TensorFlow Lite ONNXRuntime Diffusion models Neural network pruning Knowledge distillation Quantization Architecture search Sub-quadratic attention optimization Sparse mixture of experts Cloud deployment Mobile deployment

Save