Senior Performance Architect, Nemotron

Nvidia

Actively hiring Verified listing
Santa Clara, CA · Hillsboro, OR · Redmond, WA Posted 10 days ago $152,000$241,500 / year

At a glance

AI generated

TL;DR

Join us as a Senior Performance Architect at NVIDIA, where you will play a pivotal role in shaping the future of AI systems through deep model–system–hardware co-design. You will develop high-fidelity performance models to evaluate emerging algorithmic techniques and hardware optimizations for Nemotron family models, ensuring Pareto-optimal trade-offs across accuracy, throughput, and interactivity on target platforms. Your work will involve prioritizing features based on detailed modeling insights and collaborating with diverse teams including DL researchers, hardware architects, and software engineers to guide decisions that enhance the efficiency of AI in production environments. This role requires expertise in computer architecture, roofline modeling, queuing theory, and proficiency in Python for simulator design and data analysis, alongside experience with deep learning frameworks like PyTorch and TRT-LLM.

Skills

Python PyTorch TRT-LLM VLLM SGLang CUDA C++ Roofline modeling Queuing theory Statistical performance analysis Deep learning frameworks GPU computing System evaluation of AI/ML workloads

What you'll do

  • Develop high-fidelity performance models to evaluate emerging AI techniques and hardware optimizations.
  • Prioritize features based on detailed performance modeling to guide future software and hardware roadmaps.
  • Model end-to-end performance impacts of new GenAI workflows to predict datacenter needs.
  • Keep abreast of the latest DL research and collaborate with diverse teams for co-design decisions.
  • Define metrics, design experiments, and visualize large datasets to identify resource bottlenecks in AI workloads.

What we're looking for

  • Master's degree or equivalent experience in Computer Science or related fields.
  • Expertise in computer architecture, roofline modeling, queuing theory, and statistical performance analysis.
  • Strong background in ML fundamentals, model parallelism, and inference serving techniques.
  • Proficiency in Python and C++ for simulator design and data analysis.
  • 3+ years of experience in system evaluation of AI/ML workloads or performance optimization.
  • Experience with deep learning frameworks like PyTorch, TRT-LLM, VLLM, SGLang.
  • Comfortable defining metrics, designing experiments, and visualizing large datasets to identify bottlenecks.

Market check

Salary context

This $152,000–$241,500 range sits above 46% of similar postings on FindRole.

Peer median band

$165,750$257,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$165,852$241,372

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Solutions Architect, Robotics Infrastructure

Nvidia

Remote (Us, Ca, Santa Clara, US) 122 days ago $152,000$241,500
Kubernetes NVIDIA AWS Azure GCP MIG TensorRT-LLM vLLM SGLang Docker CI/CD GitOps IaC Observability ROS2 Isaac Lab Isaac Sim Cosmos Ray PostgreSQL Python Terraform NVIDIA GPU Operator REST gRPC
Remote

Senior Technical Program Manager - Nemotron

Nvidia

Us, Ca, Santa Clara, US 18 days ago $168,000$258,750
Python SQL Data质量管理 数据合规性 敏捷项目管理 CI/CD Kubernetes AWS GCP 数据标注平台 合成数据生成 大规模数据处理 机器学习模型优化 数据科学实践 数据即代码 Human-in-the-Loop注释工作流设计

Senior Solutions Architect, AI Factory Infrastructure

Nvidia

Remote (Us, Nc, Durham, US) 38 days ago $224,000$356,500
Kubernetes Linux Python Terraform Helm AWS GCP Azure PostgreSQL CI/CD Docker Prometheus Grafana NVIDIA Omniverse Ethernet networking API SDKs Linux system environments Cloud native tooling
Remote

Senior Architect, AI Solutions Engineering

Nvidia

Us, Ca, Santa Clara, US 59 days ago $224,000$356,500
Python Java Shell-script Kubernetes Docker SQL NoSQL MySQL Cassandra MongoDB Elasticsearch OpenStack Hadoop Git Puppet Chef JFrog Kafka CI/CD REST_APIs Large_Language_Models RAG Fine-Tuning_LLMs LangChain LangGraphs Cascading_models

Senior Solutions Architect, AI Infrastructure

Nvidia

Us, Ca, Santa Clara, US 140 days ago $184,000$287,500
NVIDIA_GPU ARM_Development C Python Embedded_Linux_Systems NCCL DCGM UFM APIs OEM_Working_Experience Industrial_Computing Military_Computing Ruggedized_Computing CI/CD