Large Machine Learning Model Optimization Engineer, SIML

Apple Inc

Quick summary

Work type
On-site
Location
Seattle, WA
Salary
$139,500–$258,100 / yr
Posted
28 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $222k
This role $199k
$124k most similar roles pay here $280k

This role pays less than 64% of similar roles. Most pay $194,000–$249,750 — the shaded band above. At the midpoint, this role pays about $199k versus about $222k for comparable roles.

Based on 239 similar postings.

Employer

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software

Apple Inc currently has 638 open roles on FindRole.

Listed pay typically runs $171,600–$272,100 across 505 roles with salary data.

Most-posted roles

View all roles at Apple Inc

At a glance

TL;DR · Large Machine Learning Model Optimization Engineer, SIML

As a Machine Learning Model Optimization Engineer at Apple’s SIML team, you will join an innovative research and engineering group dedicated to advancing real-time on-device technologies such as Language, Computer Vision, and Machine Perception. Your primary responsibilities include implementing optimization techniques for large language and diffusion models to enhance their performance on devices, collaborating with various teams across Apple to integrate these optimizations into user experiences, and contributing to the broader ML model lifecycle. You will leverage Python and a deep understanding of hardware-aware model optimizations, including quantization, pruning, and distillation, to deliver cutting-edge solutions that push the boundaries of what is possible in mobile AI. This role requires expertise in large-scale machine learning projects, experience with distributed inference, and a track record of publishing novel research at top conferences.

What you'll do

  • Develop and implement model compression techniques for large language and diffusion models.
  • Optimize machine learning models to enhance real-time performance on Apple devices.
  • Lead the execution of hardware-aware model optimizations across various projects.
  • Collaborate with cross-functional teams to integrate ML models into user experiences.
  • Publish novel research in top machine learning conferences and journals.
  • Drive the development of efficient ML model deployment strategies for on-device use.

What we're looking for

  • Experienced in developing large computer vision and machine learning models.
  • Proficient in hardware-aware model optimizations for efficient deployment.
  • Familiar with model compression techniques such as quantization, pruning, and distillation.
  • Possesses a MS or PhD degree in Computer Science or equivalent industry experience.
  • Experience leading large-scale projects and driving innovation in the field.
  • Strong software engineering skills in Python and ML compiler knowledge.
  • Expertise in high performance kernel implementation and distributed inference.

More like this

Similar roles

Machine Learning Research Engineer

Booz Allen Hamilton

Springfield, VA 4 days ago $99,000$225,000
PyTorch Transformer-based models Self-supervised learning Multi-task learning Docker CI/CD Python PostgreSQL Git GitHub Jupyter Notebook TensorFlow Kubernetes AWS Google Cloud Platform Azure Machine Learning Hyperspectral data Uncertainty estimation Conformal prediction OOD detection Masked autoencoders Contrastive learning Retrieval models Multimodal alignment

Machine Learning Engineer II

GEICO

Remote (Bethesda, MD) 9 days ago $105,000$215,000
Python TensorFlow PyTorch Scikit-learn AWS Azure GCP CI/CD Kubernetes Docker SQL Spark Version Control ETL Kafka
Remote Hybrid

Machine Learning Engineer

Adobe

San Jose 2 days ago $161,700$234,150
Python AWS GCP Azure MLOps CI/CD Docker Kubernetes Prometheus Terraform PostgreSQL Git Agentic systems Multi-agent orchestration LLM-as-a-judge Retrieval-Augmented Generation RAG NLP pipelines

Machine Learning Engineer

Motorola Solutions

Los Angeles, CA 54 days ago $120,000$160,000
Python TensorFlow PyTorch scikit-learn MATLAB C++ signal processing wireless communication MIMO OFDM SDRs GPU acceleration embedded machine learning real-time systems adaptive modulation beamforming cognitive radio techniques 3GPP IEEE 802.11/15 military waveforms
Hybrid