Solutions Architect, Inference Deployments
Nvidia
At a glance
AI generatedAs a Solutions Architect focused on inference at our cutting-edge team, you will work closely with engineering and DevOps teams to develop enterprise-grade AI solutions using NVIDIA’s GPU technology and Kubernetes. Your daily tasks include building efficient inference pipelines with tools like NVIDIA Dynamo, orchestrating disaggregated inference using Kubernetes for complex workloads, and accelerating these pipelines with TensorRT-LLM and other backends. You will also mentor customers and internal teams in deploying disaggregated inference systems and resolving intricate technical issues. Ideal candidates have over five years of experience in solutions architecture, a strong background in deploying distributed systems on Kubernetes, and expertise with NVIDIA’s Dynamo, Triton Inference Server, and TensorRT-LLM for model optimization. Additionally, proficiency in GPU orchestration using operators like NIM and MIG partitioning, as well as deep knowledge of transformer neural networks and inference acceleration technologies, is essential.
Skills
What you'll do
What we're looking for
Market check
This $152,000–$241,500 range sits above 57% of similar postings on FindRole.
Peer median band
$152,000–$241,500
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$161,965–$235,750
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 802 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.
Most-posted roles
More like this
Nvidia
Nvidia
Broadcom
Nvidia
Nvidia
Booz Allen Hamilton