Senior Machine Learning Systems Engineer, Ads ML Experience Platform

Reddit

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$216,700–$303,400 / yr
Posted
4 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $227k
This role $260k
$166k most similar roles pay here $318k

This role pays more than 78% of similar roles. Most pay $197,925–$256,803 — the shaded band above. At the midpoint, this role pays about $260k versus about $227k for comparable roles.

Based on 240 similar postings.

Employer

About Reddit

Reddit is a social news aggregation and discussion platform where users share content, vote on posts, and engage in community conversations across thousands of interest-based forums called subreddits.

Reddit currently has 118 open roles on FindRole.

Listed pay typically runs $217,000–$303,900 across 80 roles with salary data.

Most-posted roles

View all roles at Reddit

At a glance

TL;DR · Senior Machine Learning Systems Engineer, Ads ML Experience Platform

As a Senior Machine Learning Systems Engineer on the Ads ML Experience Platform team, you will design and build large-scale offline ML experimentation platforms that enable reproducible research and model development workflows. You’ll develop production-grade training orchestration frameworks supporting distributed training and hyperparameter optimization while also building infrastructure for experiment tracking and artifact versioning. Partnering with ML engineers and researchers, you will enhance operational efficiency and create automated workflows for continuous evaluation and compliance validation. The role involves deep expertise in large-scale distributed systems, modern orchestration technologies like Kubeflow and Argo, and experience with distributed data processing systems such as Spark or Flink. This position is integral to advancing the Ads ML lifecycle at Reddit by accelerating model iterations through scalable platform services and intelligent automation.

What you'll do

  • Design and build large-scale offline ML experimentation platforms for reproducible research.
  • Develop production-grade training orchestration frameworks supporting distributed training and hyperparameter optimization.
  • Build infrastructure for experiment tracking, metadata management, lineage, artifact versioning, and model registries.
  • Create automated workflows for continuous evaluation, compliance validation, and model promotion/rollback.
  • Design an agentic AI execution platform with autonomous and human-in-the-loop workflows.

What we're looking for

  • 5+ years of experience in infrastructure/platform engineering or large-scale distributed systems.
  • 2+ years hands-on experience building and operating production ML infrastructure, SDKs, APIs, or self-service AI tooling.
  • Experience with distributed data processing systems like Spark, Flink, Ray, or equivalent technologies.
  • Proficiency in modern orchestration and workflow technologies such as Kubeflow, Argo, Airflow, or similar frameworks.
  • Expertise in building offline ML experimentation platforms, model registries, experiment tracking systems, and training orchestration frameworks.

More like this

Similar roles

Senior Machine Learning Systems Engineer

Reddit

Remote 24 days ago $216,700$303,400
Python PyTorch Tensorflow Kubernetes Ray Apache Beam Apache Spark Ray Data GCP BigQuery Google Cloud Storage Terraform MLflow Wandb Neo4j JanusGraph TigerGraph PyTorch Geometric Deep Graph Library
Remote

Senior Machine Learning Engineer, Ads Predictions

Apple Inc

Cupertino, CA 72 days ago $212,000$318,400
Python TensorFlow PyTorch SQL Scala Java Kubernetes Docker CI/CD Prometheus Grafana PostgreSQL AWS Azure Google Cloud Platform Redis Elasticsearch Hadoop Spark Git Jenkins GitHub Bitbucket

Machine Learning Engineer, Special Projects

Apple Inc

Seattle, WA 72 days ago $139,500$258,100
Python CI/CD SQL Airflow Prefect Ray Kubernetes Terraform Docker Prometheus Grafana PostgreSQL AWS Azure Google Cloud Platform Git Jenkins Travis CI Maven Gradle Ansible Chef MongoDB Redis Cassandra Hadoop Spark TensorFlow PyTorch Scikit-learn Pandas NumPy Matplotlib Seaborn Flask Django FastAPI React Vue.js Angular Node.js Express.js

Machine Learning Engineer, Special Projects

Apple Inc

Santa Clara, CA 72 days ago $147,400$272,100
Python CI/CD SQL Airflow Prefect Ray Kubernetes Terraform Docker Prometheus Grafana PostgreSQL AWS Azure Google Cloud Platform Git Jenkins Travis CI Maven Gradle Ansible Chef MongoDB Redis Cassandra Hadoop Spark TensorFlow PyTorch Scikit-learn Pandas NumPy Matplotlib Seaborn Flask Django FastAPI React Vue.js Angular Node.js Express.js

Senior Machine Learning Engineering Manager, Ad Platforms

The Walt Disney Company

Remote (Seattle, WA) +1 25 days ago $207,400$278,100
Python TensorFlow PyTorch Hugging Face RAG prompt engineering AWS MLOps Docker Kubernetes CI/CD PostgreSQL Azure GCP Java Prometheus Grafana Git Jenkins
Remote