Software Engineer 4 – Data and Feature Infrastructure, AI Platform

Netflix

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$466,000–$750,000 / yr
Posted
9 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $199k
This role $608k
$81k most similar roles pay here $822k

This role pays more than 99% of similar roles. Most pay $162,000–$235,750 — the shaded band above. At the midpoint, this role pays about $608k versus about $199k for comparable roles.

Based on 240 similar postings.

Employer

About Netflix

Netflix is the world''s leading streaming entertainment service, offering a vast library of TV series, films, documentaries, and original content to subscribers in over 190 countries. Industry: Streaming Entertainment & Media

Netflix currently has 111 open roles on FindRole.

Listed pay typically runs $388,000–$619,000 across 109 roles with salary data.

Most-posted roles

View all roles at Netflix

At a glance

TL;DR · Software Engineer 4 – Data and Feature Infrastructure, AI Platform

As a Senior Machine Learning Engineer on the AI Platform team, you will design and build a next-generation data and feature infrastructure to enhance ML model performance across personalized recommendations, payments, games, ads, and other domains. Your daily tasks include developing a near-real-time feature computation engine for high-throughput training and low-latency inference, managing feature pipelines and serving infrastructure, and creating frameworks to expedite new data availability. You will also build scalable systems that accelerate training through efficient data handling and develop feature stores enabling discovery and sharing among various ML domains. This role requires experience in building ML or data infrastructure, proficiency with large-scale data processing tools like Spark and Flink, and expertise in Java or Python codebases, alongside a passion for delivering exceptional user experiences to ML practitioners.

What you'll do

  • Design and build a near-real-time feature computation engine for high-throughput training and low-latency inference.
  • Operate and manage feature computation pipelines and serving infrastructure across multiple ML domains.
  • Create frameworks to streamline data availability for training and serving new features.
  • Develop scalable systems that accelerate training through efficient data loading and transformation.
  • Build centralized feature stores enabling discovery and sharing of datasets across various business areas.
  • Collaborate with domain experts to ensure high-quality features and labels for ML models.

What we're looking for

  • Extensive experience in building and managing ML infrastructure.
  • Proficient in large-scale data processing frameworks like Spark, Flink, Kafka.
  • Expertise in data storage technologies including Iceberg and Cassandra.
  • Strong background in optimizing Java and Python codebases for performance.
  • Hands-on experience with public cloud platforms, particularly AWS.
  • Passionate about enhancing user experiences for ML practitioners.

More like this

Similar roles

Software Engineer 5 – Model Serving Systems, AI Platform

Netflix

Remote (Usa - Remote, US) 9 days ago $466,000$750,000
AWS Triton Inference Server TensorRT Docker Java Python Kubernetes CI/CD LLMs Model Serving Infrastructure High Availability Performance Tuning Deployment Management Capacity Planning Observability Logging
Remote

Software Engineer, Ai & Data Platforms

Apple Inc

Austin, TX 42 days ago
Python Go Docker AWS Azure Google Cloud Kubernetes Terraform VS Code TypeScript Node.js JetBrains IDEs IntelliJ Platform SDK Git CI/CD Prometheus Grafana

Software Engineer, Ai & Data Platforms

Apple Inc

Austin, TX 55 days ago
Python Go Docker AWS Azure Google Cloud Kubernetes Terraform VS Code TypeScript Node.js JetBrains IDEs IntelliJ Platform SDK CI/CD Git

Senior Software Development Engineer, AI Data Platform

Oracle

US 61 days ago $79,200$178,100
Python RESTful APIs Oracle Cloud Infrastructure (OCI) Spark PySpark Delta Lake Parquet MLOps Flink CI/CD LLMs agentic frameworks MCP LangChain Semantic Kernel RAG pipelines vector DBs Oracle 26ai FAISS Pinecone Weaviate Hadoop Kafka NoSQL