Principal Product Manager
Circle
At a glance
AI generatedAs the Product Manager for resilient automation at NVIDIA’s AI Factory, you will lead the strategic direction and roadmap of the break-fix automation system used in DGX Cloud. Your responsibilities include defining automation thresholds, integrating failure attribution with automated repair actions, and building a user-friendly operator experience that enhances workflow transparency and audit trails. You will collaborate closely with NCP operators, SRE teams, and hardware vendor partners to optimize repair workflows at scale, ensuring high reliability and operational safety. This role requires 15+ years of product management experience in infrastructure or MLOps, expertise in distributed systems and workflow orchestration, and a strong background in GPU infrastructure and datacenter operations. You will work with cutting-edge technologies to ensure AI factories can self-heal efficiently at scale.
Skills
What you'll do
What we're looking for
Market check
This $240,000–$379,500 range sits above 96% of similar postings on FindRole.
Peer median band
$163,000–$229,100
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$162,000–$224,150
Middle half of comparable postings.
Based on 239 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 801 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.
Most-posted roles
More like this
Circle
Adobe
Circle
Genentech
Broadcom
Q2