Principal Product Manager, AI Frameworks

Nvidia

Actively hiring
Us, Ca, Santa Clara, US Posted 122 days ago $240,000$379,500 / year

At a glance

AI generated

TL;DR

As a Senior Product Manager for AI Platform Post-Training and RL at NVIDIA, you will join a dynamic team focused on enabling researchers and operators to achieve success on the NVIDIA platform. Your primary responsibilities include building tools, SDKs, and libraries that enhance large-scale performance and resilience of models on GPUs. You will collaborate with internal teams and external customers to develop product roadmaps and go-to-market strategies while staying abreast of advancements in post-training software. The role requires expertise in designing and scaling training/post-training systems, knowledge of distributed computing, and experience with frameworks like VeRL, Tunix, PyTorch distributed, and torchtitan. Ideal candidates possess a strong background in machine learning concepts, GPU architecture, and performance profiling, along with extensive product management experience at leading tech companies.

Skills

Python PyTorch VeRL Tunix Nemo Framework Docker Kubernetes CI/CD GitHub PostgreSQL Prometheus Grafana AWS Azure Google Cloud Platform GitLab Jenkins Terraform OpenStack Linux CUDA C++

What you'll do

  • Create and optimize post-training RL libraries to enhance model builders' performance on NVIDIA GPUs.
  • Develop product strategy and roadmaps focusing on training/post-training software improvements.
  • Collaborate with customers to build product-based roadmaps for AI platform tools and SDKs.
  • Work with leadership to align product initiatives with company-wide strategic goals.
  • Lead the development of go-to-market plans in conjunction with marketing teams.

What we're looking for

  • Experience in designing and scaling training/post-training software and optimization tools.
  • Demonstrable knowledge of machine learning concepts, including model training and performance optimization.
  • Proven experience with large-scale distributed systems.
  • BS or MS degree in Computer Science, Engineering, or equivalent technical background.
  • 15+ years of technical product management experience at a technology company.
  • Experience leading reinforcement learning projects from research to production at scale.
  • Knowledge of GPU architecture, hardware/software co-design, and performance profiling.

Market check

Salary context

This $240,000–$379,500 range sits above 95% of similar postings on FindRole.

Peer median band

$177,560$262,400

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$185,500$246,150

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 802 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Principal Product Manager, AI Frameworks

Nvidia

Us, Ca, Santa Clara, US 139 days ago $240,000$379,500
PyTorch Distributed Systems GPU Architecture Performance Profiling CI/CD GitHub OpenSource Python PostgreSQL Kubernetes AWS NVIDIA GPUs VeRL Nemo Framework Terraform Prometheus Grafana

Senior Product Manager, AI Frameworks

Nvidia

Us, Ca, Santa Clara, US 46 days ago $168,000$258,750
Python C++ CUDA TensorFlow PyTorch FSDP GitHub Docker Kubernetes CI/CD Prometheus Grafana PostgreSQL AWS Azure GitLab OpenStack NVIDIA_GPU_Architecture HW/SW_Co_Design Performance_Profiling

Principal Applied AI Product Manager

Adobe

San Jose, US 11 days ago $194,800$282,100
SQL LLM Agentic architectures API design Figma FigJam Miro Jira Confluence Git CI/CD Python PostgreSQL AWS Kubernetes

Product Manager, AI Infrastructure

Arm Holdings

San Jose, California, US 33 days ago $211,600$286,200
AI ML Datacenter Cloud Product Management Systems Thinking Competitive Analysis GenAI ML Accelerators Hardware Software Infrastructure Performance Analysis System Layers End-to-End System Behavior Problem Solving Technical Analysis Customer Workloads Product Roadmap Engineering Priorities

Principal Technical Product Manager - Agentic AI

Intuit

San Diego, California, US 41 days ago $243,000$328,500
LLM Agentic Systems Workflow Orchestration API Contracts State Management CI/CD Observability SDKs Docker Kubernetes Python PostgreSQL AWS GCP Azure Prometheus Grafana