Senior Technical Program Manager, Cloud Infrastructure

Nvidia

Hybrid Actively hiring
Santa Clara, US · Seattle, US Posted 21 days ago $200,000$322,000 / year

At a glance

AI generated

TL;DR

NVIDIA’s DGX Cloud team seeks an experienced Technical Program Manager to drive critical programs related to AI capacity enablement and management. This TPM will partner with internal engineering teams to build foundational capabilities and processes for global AI infrastructure, focusing on cluster bring-up and maintenance. Key responsibilities include gathering technical requirements, developing roadmaps, leveraging Jira for program management, collaborating cross-functionally to onboard third-party solutions, establishing KPIs, mitigating risks, and executing communication strategies to ensure visibility with executive leadership. The ideal candidate has over 10 years of experience in technical program management within a matrixed organization, extensive hands-on cloud infrastructure expertise, proficiency with Jira and other PM tools, and deep knowledge of NVIDIA GPU products and cloud technologies like Kubernetes and Terraform.

Skills

Jira Kubernetes Terraform API integration Python CI/CD Prometheus Grafana NVIDIA GPU products AWS Azure Google Cloud Platform PostgreSQL Docker Git Scrum Agile methodologies

What you'll do

  • Gather technical requirements and develop comprehensive roadmaps for AI capacity enablement.
  • Drive the onboarding of third-party and in-house solutions for DGX Cloud deployments.
  • Establish metrics and KPIs to quantitatively demonstrate program value and impact.
  • Proactively identify, resolve, and mitigate risks affecting scope, schedule, and quality.
  • Develop and execute a robust communication strategy for program progress visibility.
  • Encourage continuous improvement within cloud infrastructure operations.

What we're looking for

  • 10+ years of technical program management experience in large-scale engineering programs
  • Extensive hands-on experience with cloud infrastructure, preferably from a major Cloud Service Provider (CSP)
  • Expert proficiency with Jira and other program management tools in an Agile/Scrum framework
  • Strong strategic and tactical thinking abilities to build consensus and drive program success
  • Excellent communication skills for executive audiences and multi-functional teams
  • BS or MS in Electrical Engineering, Computer Science, or equivalent experience
  • In-depth knowledge of NVIDIA GPU products and cloud technologies like Kubernetes and Terraform

Market check

Salary context

This $200,000–$322,000 range sits above 91% of similar postings on FindRole.

Peer median band

$145,240$233,100

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$150,090$235,156

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Technical Program Manager, Cloud Infrastructure

Nvidia

Us, Ca, Santa Clara, US 28 days ago $168,000$258,750
Jira Kubernetes Terraform API integration CI/CD NVIDIA GPU products Cloud Service Providers PostgreSQL Python Docker AWS Azure Grafana Prometheus Scrum DevOps

Technical Program Manager, Cloud Infrastructure

Nvidia

Us, Ca, Santa Clara, US 21 days ago $168,000$258,750
Jira Kubernetes Terraform API integration CI/CD AWS Azure GCP PostgreSQL Docker Prometheus Grafana GitLab Python NVIDIA GPUs Cloud-native environments AI infrastructure ML infrastructure

Senior Technical Program Manager, DGX Cloud Software Products and Services

Nvidia

Us, Ca, Santa Clara, US 25 days ago $168,000$258,750
Jira Aha! Confluence Git Distributed version control systems Reliability engineering Resilience development Service performance metrics Goodput Efficiency Utilization Distributed training frameworks Checkpointing NCCL Slurm AI infrastructure Large-scale compute platforms CI/CD

Senior Technical Program Manager, Software Compute Platform

Nvidia

Us, Ca, Santa Clara, US 49 days ago $200,000$322,000
Python Java C++ Git Jenkins Black_Duck Palamida Docker Kubernetes AWS CI/CD PostgreSQL Linux OSS_profiling Version_Control Release_Management Test_Plans Automation_Scripts