ML Engineer - Automated Evaluation and Adversarial Design

Apple Inc

Quick summary

Work type
On-site
Location
Culver City, CA
Salary
$139,500–$258,100 / yr
Posted
44 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $215k
This role $199k
$125k most similar roles pay here $272k

This role pays less than 60% of similar roles. Most pay $180,327–$249,750 — the shaded band above. At the midpoint, this role pays about $199k versus about $215k for comparable roles.

Based on 240 similar postings.

Employer

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software

Apple Inc currently has 638 open roles on FindRole.

Listed pay typically runs $171,600–$272,100 across 505 roles with salary data.

Most-posted roles

View all roles at Apple Inc

At a glance

TL;DR · ML Engineer - Automated Evaluation and Adversarial Design

As a Senior ML Engineer on the Automated Evaluation and Adversarial Design team, you will focus on building and scaling automated evaluation systems to assess AI feature quality at scale, including multi-turn conversation evaluations and end-to-end agent workflow testing. Your day-to-day responsibilities include designing adversarial test suites that probe model weaknesses and executing stress tests under demanding conditions to ensure features meet performance thresholds for hundreds of millions of users. You will develop evaluation frameworks, rubrics, and reports while collaborating with engineering partners to integrate these systems into development workflows. The role requires expertise in Python, ML frameworks like PyTorch or TensorFlow, and experience with adversarial testing methodologies across multi-turn interactions. Familiarity with productivity software, agent orchestration frameworks, and observability tooling is also beneficial as you work on shaping the evaluation infrastructure for AI features that influence product launches and model development decisions.

What you'll do

  • Define and own automated evaluation approaches for AI features, ensuring measurable assessments.
  • Build adversarial test suites targeting model failure modes across single-turn and multi-turn interactions.
  • Develop stress test protocols to validate performance under atypical input conditions, including extended conversations.
  • Ensure alignment between automated and human evaluation methods by resolving systematic disagreements.
  • Scale adversarial test case generation and execute stress tests using automation for efficiency.
  • Influence model and feature quality decisions by communicating evaluation findings to cross-functional teams.

What we're looking for

  • Bachelor’s degree in Computer Science, Machine Learning, Statistics, or related field.
  • 4+ years of experience building ML evaluation systems and designing evaluation benchmarks.
  • Experience defining evaluation architecture for AI systems with conversation-level analysis units.
  • Expertise in designing adversarial test methodologies targeting multi-turn interaction failures.
  • Proficiency with Python and ML frameworks (PyTorch, TensorFlow) in production settings.

More like this

Similar roles

AI/ML Engineer

Lam Research

Fremont, CA 64 days ago $119,000$261,000
Python C++ PostgreSQL SQLite MySQL Git Domain-Driven Design Test-Driven Development CI/CD
Hybrid

AI/ML Engineer

Booz Allen Hamilton

Norfolk, VA 3 days ago
Spark Hadoop Databricks Python Java Scala R TensorFlow Keras PyTorch CI/CD MLOps Git Jupyter Notebook PostgreSQL MongoDB AWS Azure Google Cloud Platform Kubernetes Docker

Machine Learning Engineer

Motorola Solutions

Los Angeles, CA 54 days ago $120,000$160,000
Python TensorFlow PyTorch scikit-learn MATLAB C++ signal processing wireless communication MIMO OFDM SDRs GPU acceleration embedded machine learning real-time systems adaptive modulation beamforming cognitive radio techniques 3GPP IEEE 802.11/15 military waveforms
Hybrid