Senior Solutions Architect, GPU Performance and LLM - Cloud Service Providers

Nvidia

Actively hiring
Santa Clara, US · Seattle, US Posted 21 days ago $184,000$287,500 / year

At a glance

AI generated

TL;DR

As a Solutions Architect at NVIDIA, you will join our dedicated team to assist large enterprises in developing and deploying AI and HPC solutions on a massive scale. Your day-to-day responsibilities include collaborating with Sales Account Managers and Developer Relations Managers to identify business opportunities, serving as the primary technical liaison for customers working on complex AI infrastructure projects, and conducting regular meetings to provide performance advice and debugging support. You will also build Proof of Concepts to address critical business needs and integrate NVIDIA technology into cloud services. The ideal candidate has over 8 years of experience in engineering roles with a focus on system architecture and performance tuning, hands-on experience with Deep Learning frameworks like PyTorch and JAX, as well as familiarity with NVIDIA’s hardware and software ecosystem including libraries such as TensorRT and RAPIDS. Additionally, you should be proficient in deploying solutions in cloud environments using DevOps tools like Docker and Kubernetes.

Skills

PyTorch JAX TensorRT Nemo NCCL RAPIDS AWS GCP Azure OCI Docker Kubernetes DevOps MLOps CI/CD Prometheus Grafana PostgreSQL Python

What you'll do

  • Develop and demonstrate AI/ML and HPC software solutions for tech giants using NVIDIA’s technology.
  • Identify business opportunities by partnering with Sales Account Managers and Developer Relations Managers.
  • Serve as the primary technical support for customers developing complex AI infrastructure projects.
  • Conduct regular technical meetings to provide performance advice, debugging assistance, and new technology introductions.
  • Build Proof of Concepts (PoCs) to address critical customer needs and integrate NVIDIA technology on hyperscalers.
  • Analyze and resolve customer performance issues related to both AI and systems performance.

What we're looking for

  • 8+ years of engineering experience with performance/system/solution focus.
  • Expertise in building benchmarks and optimizing large-scale AI training and inference systems.
  • Deep understanding of AI accelerators, networking, and overall system architecture impact on application performance.
  • Strong program management skills for balancing multiple tasks effectively.
  • Proficiency with deep learning frameworks, compilers, and NVIDIA libraries including TRTLLM, TensorRT, Nemo, NCCL, RAPIDS.
  • Experience deploying solutions in cloud environments like AWS, GCP, Azure, OCI using DevOps/MLOps technologies.

Market check

Salary context

This $184,000–$287,500 range sits above 85% of similar postings on FindRole.

Peer median band

$156,475$241,500

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$160,837$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Solutions Architect, NVIDIA Cloud Partners

Nvidia

Us, Ca, Santa Clara, US 44 days ago $184,000$287,500
NVIDIA GPU GenerativeAI LLMs NCCL DCGM UFM MissionControl BaseCommandManager Kubernetes Slurm CI/CD Python PostgreSQL AWS Azure Grafana Prometheus Docker Terraform

Senior Solutions Architect, NVIDIA Cloud Partners

Nvidia

Remote (Us, Ca, Santa Clara, US) 44 days ago $184,000$287,500
NVIDIA AWS Azure GCP Python PyTorch TensorFlow NVIDIA_Nemotron NVIDIA_NeMo_Framework NVIDIA_Dynamo NVIDIA_NeMo_Retriever NVIDIA_Triton_Inference_Server TensorRT TensorRT-LLM CUDA-X NCCL DCGM UFM Mission_Control Base_Command_Manager SLURM K8s MLOps
Remote

Senior Software Architect, GPU Networking

Nvidia

Us, Ca, Santa Clara, US 140 days ago $184,000$287,500
Kubernetes SDN InfiniBand Python Go Rust C++ Docker CI/CD Prometheus Grafana AWS Azure Google Cloud Platform PostgreSQL MySQL Linux Networking Operating Systems Virtualization Storage AI Deep Learning

Senior Architect, GPU Profiling System

Nvidia

Remote (Us, Ca, Santa Clara, US) 81 days ago $184,000$287,500
C++ Python SystemC GPU AI HPC CI/CD Git Linux CUDA OpenCL NVIDIA_Nsight Perforce JIRA Confluence Docker Kubernetes AWS Google_Cloud_Platform Azure PostgreSQL MongoDB
Remote

Senior Staff Engineer, GPU Software Architecture

Samsung Electronics

Remote (3900 N Capital Of Texas Hwy, Austin, Tx, Usa, US) 85 days ago $180,200$297,200
C C++ Python Vulkan DirectX Metal HLSL GLSL OpenCL CUDA Unreal Unity Linux Android OpenGL 3D graphics GPU hardware ray tracing rasterization linear algebra multi-threaded debugging performance profiling parallel programming game engines offline compiler JIT compiler
Remote

Principal Solutions Architect - GPU Cloud Network Infrastructure

Nvidia

Remote (Us, Ca, Santa Clara, US) 51 days ago $272,000$431,250
TCP/IP BGP DNS HTTP/2 QUIC High-performance networking Cloud networking services Multi-region architectures IP transit Internet peering technologies Data center networking Internet routing Traffic shaping Edge computing concepts CDN SRE CI/CD Kubernetes Terraform AWS Azure GCP
Remote