Senior Staff Machine Learning Engineer, GenAI Platform

Reddit

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$292,500–$409,500 / yr
Posted
today

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $218k
This role $351k
$150k most similar roles pay here $437k

This role pays more than 98% of similar roles. Most pay $177,900–$259,053 — the shaded band above. At the midpoint, this role pays about $351k versus about $218k for comparable roles.

Based on 240 similar postings.

Employer

About Reddit

Reddit is a social news aggregation and discussion platform where users share content, vote on posts, and engage in community conversations across thousands of interest-based forums called subreddits.

Reddit currently has 72 open roles on FindRole.

Listed pay typically runs $217,000–$303,900 across 66 roles with salary data.

Most-posted roles

View all roles at Reddit

At a glance

TL;DR · Senior Staff Machine Learning Engineer, GenAI Platform

As a Senior Staff Machine Learning Engineer on Reddit’s Machine Learning Platform team, you will lead the vision and strategy for Reddit’s large-scale GenAI Platform, focusing on designing and implementing features like unified API endpoints, rate/token limit management, and intelligent failover mechanisms. You’ll work with Kubernetes and cloud technologies such as AWS and Google Cloud Storage to ensure reliability and scalability of generative AI products across the company. Proficiency in Go, Python, and other ML frameworks is essential, along with strong communication skills for articulating technical concepts to non-technical stakeholders. This role demands deep expertise in MLOps, LLMOps standards, observability, and operational excellence to drive platform investments aligned with user needs and business priorities.

What you'll do

  • Lead and execute the vision, strategy, and roadmap for Reddit’s large-scale GenAI Platform.
  • Define platform architecture enabling teams to build, deploy, and scale generative AI products reliably.
  • Drive the strategy for a unified LLM Gateway supporting consistent APIs for internal/externally hosted models.
  • Set direction for core platform capabilities like rate/token limit management and intelligent failover mechanisms.
  • Champion MLOps and LLMOps standards across CI/CD, testing, versioning, evaluation, and lifecycle management.

What we're looking for

  • 10+ years of experience in ML Engineering or AI Platform roles.
  • Proven track record of leading technical strategy and delivering AI platforms at scale.
  • Deep expertise in Kubernetes and other orchestration systems for large-scale production environments.
  • Proficiency with cloud-based technologies like AWS, Google Cloud Storage, Terraform, etc.
  • Strong knowledge of model serving, inference pipelines, monitoring, and observability for AI systems.
  • Excellent communication skills to articulate technical concepts to non-technical stakeholders.

More like this

Similar roles

Staff Machine Learning Engineer, AI Serving

Reddit

Remote (San Francisco, CA, US) today $253,300$354,600
Kubernetes AWS Terraform Python Go Docker CI/CD Prometheus Grafana PostgreSQL Trino LLMs Triton Dynamo vLLM Pytorch
Remote

Senior Staff Machine Learning Engineer

Intuit

Mountain View, California 48 days ago $214,000$289,500
AWS GCP TensorFlow PyTorch Spark Kubernetes MLflow RAG LLM CI/CD MLOps Python Docker Prometheus PostgreSQL

Senior Staff Machine Learning Engineer

GEICO

Bethesda 37 days ago $150,000$300,000
Python AWS Azure Kubernetes Airflow Snowflake PostgreSQL MongoDB Cassandra Spark Ray MLflow Kubeflow Feast Prometheus Grafana OpenTelemetry CI/CD ElasticSearch Qdrant Parquet Delta Iceberg Flink SHAP LIME

Senior Staff Machine Learning Engineer

GEICO

Palo Alto, CA 44 days ago $150,000$300,000
Python Java C++ AWS Azure Kubernetes CI/CD Elasticsearch Snowflake Kafka PostgreSQL MongoDB Cassandra Spark Ray Airflow Temporal LLMs GPT Generative AI

Senior Staff Machine Learning Engineer

GEICO

Bethesda 37 days ago $150,000$300,000
Python Java C++ AWS Azure Kafka Spark Ray Airflow Temporal PostgreSQL MongoDB Cassandra ElasticSearch Qdrant Snowflake Parquet Delta Iceberg MLflow Kubeflow Feast Prometheus Grafana OpenTelemetry CI/CD Kubernetes