Senior Research Manager, World Model Evaluation

Nvidia

Quick summary

Work type
On-site
Location
Santa Clara, CA
Salary
$272,000–$431,250 / yr
Posted
8 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $204k
This role $352k
$133k most similar roles pay here $463k

This role pays more than 99% of similar roles. Most pay $165,000–$242,562 — the shaded band above. At the midpoint, this role pays about $352k versus about $204k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 994 open roles on FindRole.

Listed pay typically runs $168,000–$270,250 across 977 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Senior Research Manager, World Model Evaluation

As a Senior Research Manager at NVIDIA’s world model team, you will lead a group of Research Scientists in evaluating and benchmarking multimodal AI, robotics, and world foundation models. Your day-to-day responsibilities include defining a scientific roadmap for both closed-system and open-system evaluations, developing benchmarks for physical plausibility and temporal consistency, and driving evaluation-to-model-improvement loops with training teams. You will also publish high-quality papers and establish rigorous standards for model comparison. Ideal candidates have a strong research background in machine learning, computer vision, or robotics, experience leading research programs, and deep knowledge of modern foundation models. Familiarity with world-model evaluation techniques such as representation probing and causal interventions is essential. This role requires 12+ years of relevant research or engineering experience and 5+ years of management experience, with a focus on Physical AI and embodied systems at scale.

What you'll do

  • Lead a team focused on evaluating and benchmarking NVIDIA’s Physical AI models.
  • Define scientific roadmap for closed-system and open-system evaluations of world models.
  • Develop benchmarks for physical plausibility, temporal consistency, and spatial reasoning.
  • Create mechanistic evaluation methods using model internals for deep diagnostics.
  • Drive improvement loops by integrating evaluation feedback into training processes.
  • Publish high-quality papers on evaluation methodologies and open-source artifacts.

What we're looking for

  • Strong research background in machine learning, computer vision, multimodal AI, robotics, world models, representation learning, model evaluation, or mechanistic interpretability.
  • Experience leading research teams and designing serious benchmarks with measurable scientific impact.
  • Deep understanding of modern foundation models including video models, self-supervised learning, and world-model architectures.
  • Familiarity with advanced world-model evaluation techniques such as physical plausibility, temporal consistency, and causal interventions.
  • Published influential papers or tools in areas like world models, embodied AI, robotics, representation learning, or model evaluation.
  • 12+ years of relevant research or engineering experience including 5+ years of management experience.
  • Ability to work onsite at NVIDIA’s Santa Clara headquarters.

More like this

Similar roles

Sr. Research Manager, Evaluation Science

Apple Inc

Seattle, WA 38 days ago $216,600$325,500
Python Java C++ ML frameworks Cloud services CI/CD NeurIPS ICML ICLR ACL EMNLP SDKs APIs Psychometrics Human-centered design Production engineering Machine Learning Statistics

Senior Manager, Global Implementation

DoorDash, Inc

New York, NY +4 12 days ago $138,720$204,000
Salesforce Gainsight Zendesk JIRA CI/CD Python PostgreSQL AWS Kubernetes Docker Prometheus Grafana

Senior Director, Applied Research

Capital One Financial

San Francisco, CA +4 53 days ago $318,100$363,100
Python PyTorch TensorFlow Kubernetes Docker CI/CD AWS NLP Graph Neural Networks Deep Learning Recommender Systems GPU Clusters Large Language Models Data Quality Tokenization Dataset Curation Gradient Checkpointing Model Compression Model Sparsification Quantization

Senior Manager, R&D

Abbott

St. Paul, MN 77 days ago $129,300$258,700
Python MATLAB SolidWorks ANSYS CAD PLM FDA ISO IEC DOE Minitab Six Sigma Design for Manufacturing (DFM) Design for Assembly (DFA) Risk Management Project Management Lean Manufacturing Agile Methodology CI/CD

Global Head of Policy Research, Head Economist

DoorDash, Inc

Washington, DC +1 12 days ago $214,200$315,000
SQL Python R Tableau Jupyter Notebook GitHub Hypothesis Testing Regression Analysis Machine Learning AWS Google Cloud Platform CI/CD Git Docker Kubernetes

Director, Research Systems

Gilead Sciences

Remote (Foster City, CA) 116 days ago $226,185$292,710
C# ASP.NET MVC SQL Kubernetes Docker CI/CD Terraform AWS Azure PostgreSQL Git Agile Scrum Python Java REST Swagger JSON XML OAuth JWT API Gateway GraphQL
Remote