Software Engineer L4/L5, Model Serving Systems, Machine Learning Platform

Netflix

Actively hiring
Remote (Usa - Remote, US) Posted 126 days ago $466,000$750,000 / year

At a glance

AI generated

TL;DR

Join the Model Serving Systems team as an experienced engineer to develop scalable infrastructure for machine learning applications at Netflix. Your role will involve building and expanding compute resources to support large language models (LLMs) and other AI innovations, ensuring high availability and performance in real-time model inference and serving platforms. You’ll work closely with cross-functional teams including data scientists and product managers to drive ML/AI initiatives across the company. Proficiency in Java, experience with tools like Triton Inference Server, TensorRT, Docker, and public cloud services such as AWS, Azure, or GCP is essential. This role demands a passion for scalable systems and a commitment to delivering high-quality solutions in a fast-paced, multidisciplinary environment.

Skills

AWS Triton Inference Server TensorRT Docker Java Python CI/CD Kubernetes Prometheus Grafana PostgreSQL LLMs Model Serving Infrastructure

What you'll do

  • Develop and expand compute infrastructure to support growing AI needs.
  • Build scalable model-serving solutions for large language models (LLMs).
  • Reduce latency and costs in deploying ML models at scale.
  • Streamline research-to-production workflows for ML/AI applications.
  • Manage deployment, performance tuning, and capacity planning of ML systems.

What we're looking for

  • Experience building high-traffic distributed services for online ML model inference.
  • Proficient in object-oriented programming (Java) with production hosting expertise.
  • Understanding of scalable model-serving solutions for generative models and LLMs.
  • Familiarity with deploying ML models using Triton Inference Server, TensorRT, Docker.
  • Experience working with public cloud platforms like AWS, Azure, or GCP.
  • Proactive in promoting observability and logging best practices.

Market check

Salary context

This $466,000–$750,000 range sits above 100% of similar postings on FindRole.

Peer median band

$159,500$240,700

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$161,875$244,000

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Netflix

Netflix is the world''s leading streaming entertainment service, offering a vast library of TV series, films, documentaries, and original content to subscribers in over 190 countries. Industry: Streaming Entertainment & Media

Netflix currently has 91 open roles on FindRole.

Listed pay typically runs $388,000–$610,000 across 87 roles with salary data.

Most-posted roles

View all roles at Netflix

More like this

Similar roles

Machine Learning Software Engineer

Cornell University

Remote (Ithaca (Tompkins County), US) 30 days ago $98,548$114,529
Python Tensorflow Pytorch numpy pandas AWS GCP Azure Postgres Apache HTTP Server Apache Tomcat WordPress GeoServer Jenkins Firebase HubSpot Linux CI/CD
Remote

Senior Systems Software Engineer, Machine Learning

Nvidia

Us, Ca, Santa Clara, US 23 days ago $152,000$241,500
Python C/C++ Linux Unix CI/CD Docker Kubernetes AWS TensorFlow PyTorch PostgreSQL MongoDB 3D_Computer_Vision Generative_AI LLMs VLMs Multi-Agent_Systems Computer_Vision Deep_Learning

Senior Software Engineer, Machine Learning Inference

Nvidia

Us, Ca, Santa Clara, US 48 days ago $152,000$241,500
C++ Python CUDA Rust TensorRT TensorRT-LLM vLLM SGLang PyTorch JAX Deep Learning Frameworks GPU Programming Performance Analysis Optimization Techniques CI/CD