Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge Platforms

Apple Inc

Quick summary

Work type
On-site
Location
Seattle, WA
Salary
$171,600–$302,200 / yr
Posted
38 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $228k
This role $237k
$156k most similar roles pay here $318k

This role pays more than 58% of similar roles. Most pay $196,562–$260,050 — the shaded band above. At the midpoint, this role pays about $237k versus about $228k for comparable roles.

Based on 240 similar postings.

Employer

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software

Apple Inc currently has 969 open roles on FindRole.

Listed pay typically runs $163,300–$272,100 across 756 roles with salary data.

Most-posted roles

View all roles at Apple Inc

At a glance

TL;DR · Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge Platforms

Join the Foundation Model Inference Team within AI, Search & Knowledge Platforms as a Staff/Senior Machine Learning Engineer to build and optimize inference stacks for Apple's largest foundation models. You will collaborate with research teams to enhance model architectures, work closely with product teams to deploy production-grade solutions serving millions of users in real-time, and develop tools to identify performance bottlenecks across various hardware configurations. Additionally, you will mentor engineers within the organization. The role requires expertise in GPU programming with CUDA, proficiency in ML frameworks like PyTorch or TensorFlow, and experience with high-throughput services at supercomputing scale. Familiarity with Nvidia TensorRT-LLM, vLLM, DeepSpeed, and Triton Server is preferred, as well as skills in modern languages such as Golang and Python.

What you'll do

  • Optimize inference for cutting-edge model architectures alongside the research team.
  • Develop production-grade solutions to launch models serving millions of customers in real time.
  • Create tools and simulators to identify bottlenecks in inference across various hardware.
  • Mentor engineers within the organization to enhance their skills and knowledge.
  • Work on high-throughput services at supercomputing scale for efficient model deployment.

What we're looking for

  • 5+ years of experience leading complex projects
  • Expertise in LLM inference stack and GPU programming with CUDA
  • Proficiency in ML frameworks like PyTorch or TensorFlow
  • Experience with high-throughput services at supercomputing scale
  • BS degree in Computer Science, AI, Machine Learning, Information Retrieval, or Data Science

More like this

Similar roles

Staff Machine Learning Engineer - Applied AI

Uber

San Francisco, CA 31 days ago $232,000$232,000
Python PyTorch Distributed Training Transformers Retrieval Systems Ranking Embedding Architectures Kubernetes AWS CI/CD PostgreSQL Mentorship Technical Leadership

Senior Staff Machine Learning Engineer, Search & Discovery

SpaceX

Remote (US) 81 days ago $313,000$330,500
MachineLearning LargeLanguageModels AgenticAISystems RecommendationSystems RankingModels Embeddings RepresentationLearning SearchSystems GenerativeAI CI/CD Python Scalability RealTimeSystems Experimentation CloudServices Docker Kubernetes Terraform
Remote