Senior Solutions Architect - Cluster Design and Architecture

Nvidia

Actively hiring
Us, Ca, Santa Clara, US Posted 128 days ago $184,000$287,500 / year

At a glance

AI generated

TL;DR

As a Senior Solutions Architect on NVIDIA’s Cluster Design and Architecture team, you will play a pivotal role in assisting with designs and architectures for next-generation GPU-based clusters that power advanced AI supercomputers and enterprise infrastructure. Your responsibilities include partnering with internal engineering teams to convey critical architecture information to field teams and customers, guiding them through complex cluster design challenges while ensuring optimal performance and supportability. You will also work on end-to-end cluster deployments, perform hands-on debugging for configuration issues, and provide feedback from the field back to engineering teams. Essential skills include expertise in large-scale distributed systems, AI clusters, HPC infrastructure, and NVIDIA products such as GPUs and NVLink, along with strong customer-facing communication abilities.

Skills

GPU NVLink NVIDIA Networking NCCL MPI IMEX NMX Distributed training Cluster design HPC infrastructure AI clusters Debugging Customer documentation Multi-functional communications

What you'll do

  • Partner with engineering to convey GPU cluster design information to field teams and customers.
  • Guide field teams and customers on designing optimal GPU clusters under complex constraints.
  • Ensure successful first deployments of new NVIDIA products in customer environments.
  • Provide feedback from the field to internal engineering teams for product improvements.
  • Assist field teams in debugging issues related to cluster configuration and performance.
  • Support NPI customer deployments with new GPU/Networking architectures.

What we're looking for

  • BS, MS, or PhD in a relevant technical field or equivalent experience
  • 8+ years of cluster design, validation, and issue resolution on GPU/HPC clusters
  • Proven expertise in designing large-scale distributed systems, AI clusters, or HPC infrastructure
  • Ability to translate complex engineering concepts into customer documentation and reference material
  • Experience leading the bring-up and build of large-scale AI Factory or HPC clusters
  • Hands-on experience with NVIDIA GPUs, NVLink, Networking products, and debugging issues during deployment
  • Knowledge of NCCL, MPI, IMEX, NMX, and collectives in distributed training for cluster designs

Market check

Salary context

This $184,000–$287,500 range sits above 85% of similar postings on FindRole.

Peer median band

$153,500$250,800

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$162,000$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 802 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Solutions Architect - Cluster Design and Architecture

Nvidia

Us, Ca, Santa Clara, US 125 days ago $184,000$287,500
GPU NVLink NVIDIA Networking NCCL MPI IMEX NMX Distributed training Cluster design HPC infrastructure AI clusters CI/CD Debugging Performance modeling Terraform AWS Kubernetes PostgreSQL

Senior Solutions Architect - Data Center Infrastructure

Nvidia

Remote (Us, Ca, Santa Clara, US) 17 days ago $184,000$287,500
NVIDIA GPU Networking Deep Learning AI Infrastructure Hyperscaler Cloud Service Provider CSP Linux NCCL DCGM UFM APIs Embedded Linux Systems Hardware Demos System Designs Technical Training Sales Training Customer Support Problem Solving Data Analysis Logs Analysis
Remote

Senior Solutions Architect - Data Center Infrastructure

Nvidia

Remote (Us, Ca, Santa Clara, US) 17 days ago $152,000$241,500
NVIDIA GPU Networking Deep Learning Inference System Design Cloud Services Hyperscaler CSP OEM AI Market Technical Support Hardware Demos Software Libraries NCCL DCGM UFM Embedded Linux Systems APIs
Remote

Senior Solution Architect

Sony Group Corporation

Na / Culver City Corporate Pointe 40, US 27 days ago $158,808$160,000
Snowflake Python Airflow GitHub CI/CD Kubernetes Terraform AWS Azure Google Cloud Platform Docker PostgreSQL Redis MongoDB GitLab Jenkins Ansible Prometheus Grafana