Senior Staff Machine Learning Engineer, Data & Eval - Careers

Airbnb

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$244,000–$305,000 / yr
Posted
1 day ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $205k
This role $274k
$153k most similar roles pay here $321k

This role pays more than 89% of similar roles. Most pay $169,676–$239,500 — the shaded band above. At the midpoint, this role pays about $274k versus about $205k for comparable roles.

Based on 240 similar postings.

Employer

About Airbnb

Founded in 2008 and formerly known as AirBed & Breakfast, Inc., Airbnb is a global marketplace connecting travelers with hosts who offer unique accommodations, ranging from private rooms to entire homes. It operates a massive digital platform for booking stays, experiences, and travel services worldwide.

Airbnb currently has 75 open roles on FindRole.

Listed pay typically runs $204,000–$255,000 across 45 roles with salary data.

Most-posted roles

View all roles at Airbnb

At a glance

TL;DR · Senior Staff Machine Learning Engineer, Data & Eval - Careers

As a Senior Staff Machine Learning Engineer on Airbnb’s Core ML team, you will lead the technical direction and execution of ML evaluation systems for CSxAI products such as assistive agents and issue resolution tools. Your responsibilities include defining evaluation strategies, building scalable frameworks, designing data flywheels, and driving cross-functional quality initiatives to ensure continuous improvement. You will work with product, engineering, and design teams to create trusted, scalable, and actionable evaluation systems that connect offline metrics to online outcomes. The role requires deep expertise in evaluation methodology, hands-on experience with Generative AI systems, and proficiency in building data pipelines and quality systems. Ideal candidates have a PhD or equivalent industry experience, 10+ years of end-to-end ML/AI system development, and leadership experience in large technical initiatives.

What you'll do

  • Define evaluation strategy and success metrics for GenAI systems.
  • Build and scale evaluation frameworks with strong controls for bias and reliability.
  • Design the data flywheel to support continuous improvement of AI models.
  • Lead cross-functional quality initiatives across product, ops, and engineering teams.
  • Develop and productionize pipelines for dataset creation and model monitoring.

What we're looking for

  • PhD in Computer Science, Mathematics, Statistics, or related field (or equivalent experience).
  • 10+ years of hands-on experience building and shipping ML/AI systems end-to-end.
  • 2+ years of production experience with Generative AI/LLM systems.
  • 5+ years of leadership experience guiding technical initiatives as a senior individual contributor.
  • Deep expertise in evaluation methodology for offline and online alignment, metric design, human-in-the-loop evaluation, A/B testing, power analysis, regression testing.
  • Hands-on experience building data pipelines and quality systems including labeling workflows, dataset curation, versioning, monitoring, and governance.

More like this

Similar roles

Senior Staff Machine Learning Engineer, Post Training - Careers

Airbnb

Remote (San Francisco, CA, US) 1 day ago $248,000$310,000
Python PyTorch LLM fine-tuning LLM alignment Reinforcement learning Language model evaluation Multilingual modeling Multimodal modeling Data processing Model optimization Inference runtime Runtime optimizations Model quantization Compression On-device inference GPU inference Kernel development CI/CD
Remote

Senior Staff Machine Learning Engineer, Trust - Careers

Airbnb

Remote (San Francisco, CA, US) 1 day ago $244,000$305,000
Python Scala Java Tensorflow PyTorch Kubernetes AgenticAI CI/CD A/B testing API design Data pipelines Gradient boosted trees Neural networks Deep learning Feature engineering Anomaly detection Natural language processing Computer vision Recommendation systems Test-driven development
Remote

Senior Staff Machine Learning Engineer, Growth Platform Engineering - Careers

Airbnb

Remote (San Francisco, CA, US) 1 day ago $244,000$305,000
Python Scala Java Tensorflow PyTorch Kubernetes Airflow Apache Kafka CI/CD ML/AI Agile APIs Data Pipelines Feature Engineering A/B Testing Deep Learning Natural Language Processing Computer Vision Recommendation Systems Anomaly Detection Scalability Automation AI Orchestration
Remote