Senior Manager, DGX Cloud Technical Program Management

Nvidia

Hybrid Actively hiring
Santa Clara, US · Seattle, US Posted 25 days ago $240,000$379,500 / year

At a glance

AI generated

TL;DR

As a Technical Program Management Manager at NVIDIA’s DGX Cloud organization, you will lead a team responsible for core infrastructure programs including network, storage, trust services, security, telemetry, and break/fix operations. Your role involves driving cross-functional alignment, managing priorities, dependencies, risks, and delivery plans across engineering teams, and building clear operating rhythms to ensure resilience and scalability of DGX Cloud. You will need over 12 years of experience in technical program management with a focus on infrastructure programs, strong communication skills, and expertise in cloud infrastructure, distributed systems, observability tools like Grafana and Prometheus, and security compliance. This role is pivotal in supporting NVIDIA’s mission to advance AI innovation through robust and reliable infrastructure solutions.

Skills

Grafana Prometheus Kubernetes AWS Azure CI/CD Docker Python PostgreSQL Terraform GitLab Jenkins Ansible NVIDIA GPU AI/ML platforms observability telemetry cloud infrastructure distributed systems security compliance

What you'll do

  • Lead a team of Technical Program Managers for DGX Cloud core infrastructure projects.
  • Define priorities, achievements, dependencies, and delivery plans across multiple workstreams.
  • Build operating rhythms for infrastructure planning and cross-functional decision-making.
  • Improve access to infrastructure health through metrics, dashboards, and reporting tools.
  • Coordinate break/fix and operational readiness programs to enhance reliability and response.

What we're looking for

  • Over 12 years in technical program management, including at least 3 years supervising TPMs.
  • Extensive experience managing cloud infrastructure programs involving networking, storage, security, and observability.
  • Proven ability to manage priorities, risks, and execution plans across multiple engineering teams.
  • Strong track record building TPM operating rhythms and improving operational processes for break/fix operations.
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field required.
  • Experience with AI/ML platforms, GPU clusters, observability tools like Grafana and Prometheus.

Market check

Salary context

This $240,000–$379,500 range sits above 96% of similar postings on FindRole.

Peer median band

$154,400$238,500

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$165,000$235,750

Middle half of comparable postings.

Based on 239 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Technical Program Manager, DGX Cloud Software Products and Services

Nvidia

Us, Ca, Santa Clara, US 25 days ago $168,000$258,750
Jira Aha! Confluence Git Distributed version control systems Reliability engineering Resilience development Service performance metrics Goodput Efficiency Utilization Distributed training frameworks Checkpointing NCCL Slurm AI infrastructure Large-scale compute platforms CI/CD

Senior Technical Program Manager, DGX Cloud - Trust Services

Nvidia

Us, Ca, Santa Clara, US 25 days ago $200,000$322,000
Jira Confluence CI/CD GPU Firmware Security Confidential Computing Device Trust Hardware/Software Trust Models Cloud Platforms Automation Telemetry Dashboards PostgreSQL Kubernetes AWS Azure Grafana Prometheus

Senior Technical Program Manager, Cloud Infrastructure

Nvidia

Us, Ca, Santa Clara, US 21 days ago $200,000$322,000
Jira Kubernetes Terraform API integration Python CI/CD Prometheus Grafana NVIDIA GPU products AWS Azure Google Cloud Platform PostgreSQL Docker Git Scrum Agile methodologies

Senior Technical Program Manager, Cloud Infrastructure

Nvidia

Us, Ca, Santa Clara, US 28 days ago $168,000$258,750
Jira Kubernetes Terraform API integration CI/CD NVIDIA GPU products Cloud Service Providers PostgreSQL Python Docker AWS Azure Grafana Prometheus Scrum DevOps