Research Scientist / Engineer, Foundation Model Evaluation

Apple Inc

Quick summary

Work type
On-site
Location
Cupertino, CA
Salary
$181,100–$318,400 / yr
Posted
56 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $194k
This role $250k
$121k most similar roles pay here $340k

This role pays more than 79% of similar roles. Most pay $142,450–$245,900 — the shaded band above. At the midpoint, this role pays about $250k versus about $194k for comparable roles.

Based on 240 similar postings.

Employer

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software

Apple Inc currently has 1723 open roles on FindRole.

Listed pay typically runs $162,500–$272,100 across 1398 roles with salary data.

Most-posted roles

View all roles at Apple Inc

At a glance

TL;DR · Research Scientist / Engineer, Foundation Model Evaluation

As a Research Scientist or Engineer in the Foundation Model Evaluation team at Apple, you will play a critical role in enhancing models that power daily-used products. Your responsibilities include designing and implementing rigorous evaluation systems for large language models across reasoning, knowledge, code, and agentic workflows, ensuring these evaluations reflect real user experiences. You will collaborate closely with model training and product teams to integrate actionable insights into the development cycle, driving continuous improvement. The ideal candidate has a strong background in AI model evaluation, NLP, and machine learning, with proficiency in Python and ML frameworks like PyTorch or JAX. Experience in experimental design, human evaluation methodology, and building reusable evaluation tooling is highly valued. This role involves working on cutting-edge technologies to address complex challenges at scale, impacting the quality of products used by billions globally.

What you'll do

  • Design and implement evaluation benchmarks for AI models.
  • Develop methods to measure model performance in real product settings.
  • Research and apply state-of-the-art evaluation techniques and build tools.
  • Execute rigorous experiments comparing model capabilities and analyze results.
  • Translate research insights into actionable recommendations for training strategies.

What we're looking for

  • 3+ years of AI model evaluation, NLP, or related field experience
  • Strong machine learning, NLP, and statistical analysis fundamentals
  • Proficiency in Python and ML frameworks like PyTorch or JAX
  • Ability to translate research insights into practical implementations
  • Expertise in designing rigorous experiments and drawing valid conclusions

More like this

Similar roles

Principal Applied Research Engineer/Scientist

Apple Inc

Cupertino, CA 88 days ago $212,000$386,300
Python PyTorch TensorFlow Transformer LLMs Multi-modal_models Cross-modal_attention CI/CD MLOps AWS Kubernetes Docker Git Scikit-learn PostgreSQL Redis NVIDIA_GPU CUDA AutoML Ray

Research Engineer

Adobe

Seattle 85 days ago $146,300$211,850
Python PyTorch TensorFlow JAX C++ TensorRT AITemplate CoreML WinML TensorFlow Lite ONNXRuntime Diffusion models Neural network pruning Knowledge distillation Quantization Architecture search Sub-quadratic attention optimization Sparse mixture of experts Cloud deployment Mobile deployment