Solutions Architect - Rack Scale AI Systems

Nvidia

Actively hiring
Santa Clara, CA · Austin, TX · Hillsboro, OR Posted 126 days ago $208,000$333,500 / year

At a glance

AI generated

TL;DR

NVIDIA’s Cloud Infrastructure Team within IPP is seeking a Solutions Architect to collaborate with various product teams on deploying and optimizing Rack Scale AI Products in data centers and labs. This role involves designing sophisticated cloud services, rolling out new development features for NVIDIA hardware, and integrating cluster deployment methods into the cloud. The ideal candidate will have over 12 years of experience, including extensive Linux and scripting expertise, a strong background in OS kernels and system engineering, and proficiency in embedded systems, orchestration, automation, data centers, and cloud architecture. They should also possess excellent communication skills and be adept at problem-solving in complex multi-site deployments. The position requires understanding dense data center design and large-scale QA environments, with experience in GPU clusters and high-speed interconnects like InfiniBand.

Skills

Linux Python Kubernetes Terraform AWS GCP Azure Docker CI/CD Prometheus Grafana PostgreSQL Ansible Git InfiniBand MPI NVIDIA GPUs Tegra Processors

What you'll do

  • Work with NVIDIA Product Teams to understand new product roadmaps and requirements.
  • Design optimal solutions for deploying products in data centers or lab environments.
  • Assist in the roll-out of new development features supporting latest NVIDIA hardware.
  • Define full-scale solutions for onboarding products into hosted and private cloud environments.
  • Solve complex problems involving multi-site deployments of NVIDIA products.
  • Collaborate with cross-functional teams to deliver a reliable platform from concept to deployment.
  • Integrate and optimize cluster deployment methods, managing software stack deployments.

What we're looking for

  • 12+ years of relevant experience in cloud infrastructure or related field
  • Solid background in Linux and scripting with 6+ years of hands-on experience
  • Strong technical skills in embedded systems, orchestration, automation, data centers, and cloud architecture
  • Experience in deploying and optimizing GPU and compute clusters in large-scale environments
  • Understanding of dense data center design including compute, storage, and networking
  • Track record of quickly understanding new technologies and deploying complex systems in fast-paced environments
  • Strong problem-solving skills with experience in product engineering, failure analysis, and debug/hardware design

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $205k
This role $271k
$144k most similar roles pay here $354k

This role pays more than 86% of similar roles. Most pay $164,373–$246,150 — the shaded band above. At the midpoint, this role pays about $271k versus about $205k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 824 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 812 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Solutions Architect, AI and ML

Nvidia

Redmond, WA 86 days ago $124,000$195,500
AWS GCP Azure TensorFlow PyTorch CUDA RAPIDS Kubernetes Docker Python DevOps CI/CD NVIDIA GPUs GPU-based systems Deep Learning Parallel programming Distributed computing platforms

Solutions Architect, AI and ML

Nvidia

Redmond, WA 91 days ago $124,000$195,500
AWS GCP Azure TensorFlow PyTorch CUDA RAPIDS Kubernetes Docker Python DevOps CI/CD NVIDIA GPUs GPU-based systems Deep Learning Parallel programming Distributed computing platforms

AI Solution Architect

Booz Allen Hamilton

Nellis Afb, NV 20 days ago $112,800$257,000
Palantir Foundry Palantir Gotham Kubernetes DevSecOps CI/CD Docker LLM AI/ML DevOps Secret clearance Top Secret clearance AWS

Solutions Architect, AI Models

Nvidia

Remote (Santa Clara, CA) 43 days ago $152,000$241,500
Python PyTorch TensorFlow Hugging Face Transformers Kubernetes SLURM Docker CI/CD Prometheus Grafana PostgreSQL Git Jupyter Notebook NVIDIA NeMo NVIDIA Nemotron Linux AWS Azure Google Cloud Platform
Remote