Senior Solutions Architect, Generative AI Deployment and AIOps

Nvidia

Actively hiring
Remote (Us, Ca, Santa Clara, US) Posted 52 days ago $184,000$287,500 / year

At a glance

AI generated

TL;DR

NVIDIA seeks an experienced AI Solutions Architect to join its dynamic team as a senior-level technical advisor, assisting customers in building innovative solutions using the latest Accelerated Computing and Deep Learning technologies. This role involves collaborating closely with internal teams on performance analysis and modeling of inference software for Generative AI and Large Language Models (LLMs), while also engaging with developers, researchers, and data scientists to define high-value solutions. The ideal candidate will have 8+ years of hands-on experience with deep learning frameworks like PyTorch and TensorFlow, strong Python programming skills, and proficiency in GPU orchestration within Kubernetes environments. Additionally, familiarity with NVIDIA GPUs and software libraries such as TensorRT and TensorRT-LLM is essential for optimizing AI inference workloads on Kubernetes clusters.

Skills

Python PyTorch TensorFlow Kubernetes MIG Docker Prometheus Grafana NVIDIA GPUs NVIDIA NIM Dynamo TensorRT TensorRT-LLM C++ CUDA MPI

What you'll do

  • Define high-value solutions by understanding strategies and technical needs of internal teams.
  • Engage dynamically with developers, researchers, and data scientists to gain experience across technical areas.
  • Partner strategically with key customers and solution partners for NVIDIA’s computing platform.
  • Help customers adopt and build creative solutions using NVIDIA technology and MLOps practices.
  • Analyze performance and power efficiency of AI inference workloads on Kubernetes environments.

What we're looking for

  • 8+ years of hands-on experience with Deep Learning frameworks like PyTorch and TensorFlow.
  • Strong proficiency in Python for programming, optimizations, and software design.
  • Expertise in GPU orchestration, Multi-Instance GPU management within Kubernetes environments.
  • Experience deploying or optimizing DL inference at scale in production settings.
  • Proficiency with NVIDIA GPUs and software libraries including NIM, Dynamo, TensorRT, TensorRT-LLM.
  • Excellent C/C++ programming skills for debugging, profiling, code optimization, performance analysis.

Market check

Salary context

This $184,000–$287,500 range sits above 74% of similar postings on FindRole.

Peer median band

$170,000$254,800

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$167,150$244,400

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 802 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Solutions Architect, Generative AI

Nvidia

Remote (Us, Ca, Santa Clara, US) 43 days ago $184,000$287,500
Python C++ PyTorch JAX CUDA CUTLASS cuDNN NCCL Kubernetes MLOps GitHub Prometheus Grafana CI/CD
Remote

Senior Solutions Architect, Generative AI

Nvidia

Remote (Us, Ca, Santa Clara, US) 31 days ago $184,000$287,500
Python C++ PyTorch JAX CUDA CUTLASS cuDNN NCCL Kubernetes MLOps GitHub Prometheus Grafana CI/CD
Remote

Senior Solutions Architect, AI Hyperscalers

Nvidia

Remote (Us, Ca, Santa Clara, US) 15 days ago $184,000$287,500
Python CUDA PyTorch JAX Linux Docker Kubernetes HPC GPU Distributed Training Inference Optimization Vector Databases RAG Pipelines Multi-node Clusters Deep Learning Frameworks
Remote

Senior Solutions Architect, Generative AI Specialist

Nvidia

Us, Ca, Santa Clara, US 43 days ago $184,000$287,500
NVIDIA_AIMMO LangChain Haystack MLOps CI/CD LoRA QLoRA DoRA Kubernetes Docker Prometheus Grafana AWS Azure GitHub Python PostgreSQL Redis MongoDB MCP_protocol

Senior AI Solutions Architect

Nvidia

Remote (Us, Ca, Santa Clara, US) 24 days ago $152,000$241,500
Python C/C++ PyTorch Tensorflow Kubernetes GitHub NVIDIA CUDA Docker PCIe GPU FPGA DSP OpenCL HDL CI/CD
Remote