Staff Machine Learning Engineer, AI Serving

Reddit

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$253,300–$354,600 / yr
Posted
today

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $210k
This role $304k
$150k most similar roles pay here $377k

This role pays more than 96% of similar roles. Most pay $172,000–$247,875 — the shaded band above. At the midpoint, this role pays about $304k versus about $210k for comparable roles.

Based on 240 similar postings.

Employer

About Reddit

Reddit is a social news aggregation and discussion platform where users share content, vote on posts, and engage in community conversations across thousands of interest-based forums called subreddits.

Reddit currently has 72 open roles on FindRole.

Listed pay typically runs $217,000–$303,900 across 66 roles with salary data.

Most-posted roles

View all roles at Reddit

At a glance

TL;DR · Staff Machine Learning Engineer, AI Serving

As a Staff Machine Learning Engineer on Reddit’s Machine Learning Platform team, you will lead the development of a large-scale ML Inference Platform, designing and implementing highly available GPU-based model serving systems for search, ranking, and LLMs supporting millions of queries per second. You’ll develop feature hydration and processing systems, create a unified GPU model export framework, and build an E2E inference performance benchmarking tool. The role requires deep experience with Kubernetes, cloud technologies like AWS and Google Cloud Storage, and proficiency in Python alongside modern AI/ML frameworks such as Triton, Dynamo, vLLM, and PyTorch. You will work on complex multi-cluster compute environments and ensure real-time ML observability for model performance tracking.

What you'll do

  • Lead the design and implementation of a highly available, low-latency GPU-based model serving system.
  • Develop ML and Generative AI systems in cloud environments on Kubernetes at scale.
  • Create prototypes for high-performance feature hydration and processing systems.
  • Design and implement a unified GPU model export framework for optimized inference models.
  • Build an end-to-end inference performance benchmarking framework.

What we're looking for

  • 7+ years of experience in ML Engineering or related roles.
  • Deep expertise in Kubernetes and cloud-based technologies for AI platforms.
  • Strong proficiency in Python and modern AI/ML frameworks like PyTorch, Triton.
  • Experience designing and implementing large-scale GPU-based model serving systems.
  • Knowledge of real-time ML observability and performance benchmarking.

More like this

Similar roles

Staff Machine Learning Engineer, Compute

General Motors (GM)

Remote (Gm Automation - Sunnyvale - Gm Automation - Sunnyvale, US) 94 days ago $198,900$304,800
Python Kubernetes GCP Azure AWS Go C++ Docker CI/CD PyTorch Ray Prometheus Grafana PostgreSQL Redis GitLab GitHub MESOS YARN
Remote

Staff Machine Learning Engineer

Intuit

Mountain View, CA 25 days ago $202,500$274,000
Python AWS GCP Kubernetes Docker CI/CD Terraform PostgreSQL MySQL Redis Hadoop Spark TensorFlow PyTorch Scikit-learn Pandas NumPy Jupyter Git

Senior Staff Machine Learning Engineer, AI Agent Platform

GEICO

New York City, NY 56 days ago $130,000$300,000
Python Java Go C++ Kubernetes CI/CD MCP A2A LLM GPU Elasticsearch Qdrant Kafka PostgreSQL MongoDB Spark Ray Temporal Redis Neo4j Azure AWS LangGraph AutoGen CrewAI LangSmith Arize_Phoenix Langfuse RAG RBAC ABAC

Staff Machine Learning Engineer

Intuit

Mountain View, CA 52 days ago $202,500$274,000
Python Scikit-learn NLTK NumPy Pandas TensorFlow Keras R Spark SQL Git AWS GCP CI/CD

Staff Machine Learning Engineer

Arm Holdings

Austin, TX 52 days ago $249,900$338,100
Python TensorFlow PyTorch GPU ARM ML Model Optimization Deep Learning Computer Architecture CI/CD
Hybrid