Solutions Architect, LLM Model Builder

Nvidia

Actively hiring
Santa Clara, CA Posted 57 days ago $152,000$241,500 / year

At a glance

AI generated

TL;DR

NVIDIA is hiring a Solutions Architect, Foundation Models to join its Partner Solutions Architecture team as a strategic technical expert and hands-on advisor. This role involves guiding partners in building, fine-tuning, optimizing, and deploying foundation model solutions for customer workloads across reasoning, multimodal models, and production inference. Key responsibilities include defining benchmark plans, synthetic data workflows, and repeatable validation recipes, advising on compute planning, and developing reference architectures using CUDA, NeMo, Nemotron, Dynamo, TensorRT-LLM, Triton, and related tools. The ideal candidate holds an MSc or PhD in a relevant field with extensive experience in large-scale inference systems and strong programming skills in Python, PyTorch, JAX, or TensorFlow. Familiarity with GPU infrastructure and active open-source contributions are also highly valued.

Skills

Python PyTorch JAX TensorFlow NVIDIA_NeMo Nemotron Dynamo TensorRT-LLM Triton vLLM CUDA NVLink InfiniBand MPI NCCL CI/CD Prometheus Grafana

What you'll do

  • Serve as the lead technical advisor for partners on reasoning and multimodal models.
  • Guide partners to optimize customer workloads through fine-tuning and benchmarking.
  • Define benchmark plans and validation recipes for repeatable testing processes.
  • Advise on compute planning including GPU selection, network configuration, and storage.
  • Develop reference architectures and sizing models for production-readiness testing.

What we're looking for

  • MSc or PhD in Computer Science, Electrical Engineering, or related fields with 5+ years of experience in LLMs and large-scale inference systems.
  • Hands-on expertise in fine-tuning, benchmarking, evaluation, optimization, and production deployment of foundation models.
  • Strong programming skills in Python and proficiency with PyTorch, JAX, TensorFlow, Nemotron, NeMo, Dynamo, TensorRT-LLM, Triton, vLLM.
  • Experience helping partners or customers deploy large-scale AI systems in production environments.
  • Familiarity with GPU infrastructure including NVLink, InfiniBand, MPI, NCCL, and cluster technologies.
  • Active contributions to open-source software projects related to model tooling, inference, evaluation, or performance optimization.

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $215k
This role $197k
$139k most similar roles pay here $271k

This role pays less than 59% of similar roles. Most pay $184,325–$246,150 — the shaded band above. At the midpoint, this role pays about $197k versus about $215k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 824 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 812 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Solutions Architect, LLM Model Builder

Nvidia

Santa Clara, CA 57 days ago $152,000$241,500
Python PyTorch JAX TensorFlow NVIDIA_NeMo Nemotron Dynamo TensorRT-LLM Triton vLLM CUDA NVLink InfiniBand MPI NCCL CI/CD

Solutions Architect

Booz Allen Hamilton

McLean, Virginia 68 days ago $99,000$225,000
AWS CI/CD Kubernetes Docker Terraform Python PostgreSQL Git Jenkins Ansible Prometheus Grafana

Solutions Architect

Booz Allen Hamilton

McLean, Virginia 33 days ago $112,800$257,000
AWS Azure GCP microservices CI/CD Secret clearance required experience network diagrams technical specifications system installation procedures digital transformation modern system architecture principles SaaS PaaS IaaS

Solutions Architect

Booz Allen Hamilton

Chantilly, VA 13 days ago $112,800$257,000
AWS Azure GCP microservices CI/CD Python Docker Kubernetes Terraform PostgreSQL AI full-stack development SaaS PaaS IaaS

Solutions Architect

Equifax

Alpharetta, GA 6 days ago
GCP AWS Python Java JavaScript Bash Go GitHub Docker Kubernetes Terraform CI/CD Prometheus PostgreSQL RDS EventDrivenArchitecture CQRS Microservices DevSecOps
Hybrid

Solution Architect

The Hartford

Hartford, CT 84 days ago $131,600$197,400
AWS GCP Java Python .NET Angular React Microservices APIs SaaS CI/CD Cloud-Native Kubernetes Docker Terraform REST Event-Driven Agile SAFe Micro Frontends DevOps
Hybrid