Compiler Engineer - AI Inference

Nvidia

Quick summary

Work type
On-site
Location
Santa Clara, CA
Salary
$152,000–$241,500 / yr
Posted
53 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $193k
This role $197k
$138k most similar roles pay here $253k

This role pays more than 51% of similar roles. Most pay $160,000–$225,400 — the shaded band above. At the midpoint, this role pays about $197k versus about $193k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 980 open roles on FindRole.

Listed pay typically runs $168,000–$270,250 across 966 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Compiler Engineer - AI Inference

NVIDIA is hiring senior-level AI Compiler Engineers to join their pioneering team, where you will play a crucial role in advancing AI performance by developing cutting-edge kernel generation and computational graph optimizations for next-generation NVIDIA GPUs. Your responsibilities include solving complex compilation challenges for both inference and training workloads, collaborating with experts across software, hardware, and research divisions on co-design initiatives, and scaling AI deployments to datacenter environments. Ideal candidates possess a BS or MS in Computer Science or related fields, along with 3+ years of industry experience in compiler optimizations and hands-on MLIR expertise. Strong skills in C/C++ and Python are essential, as is the ability to design comprehensive compiler frameworks from scratch, alongside deep knowledge of Large Language Model inference and its impact on computer architecture.

What you'll do

  • Drive technical innovation by developing kernel generation and computational graph optimizations for NVIDIA GPUs.
  • Solve complex compilation problems for AI workloads to enhance both inference and training performance.
  • Collaborate with hardware experts to co-design future silicon architectures for AI applications.
  • Advance datacenter-scale AI workload deployments through optimization and scalability improvements.
  • Design comprehensive compiler frameworks from the ground up, showcasing deep architecture understanding.

What we're looking for

  • 3+ years of industry experience in compiler optimizations, synthesis, and placement.
  • Strong hands-on experience with MLIR and Large Language Model (LLM) inference.
  • Exceptional skills in C/C++ and Python programming, debugging, performance analysis.
  • BS or MS in Computer Science/Engineering or equivalent; PhD preferred.
  • Experience implementing complex AI workloads on CPU/GPU/custom AI accelerators.
  • Proven ability to design comprehensive compiler frameworks from scratch.

More like this

Similar roles

Senior Compiler Engineer - AI

Nvidia

Remote (Austin, TX) +4 32 days ago $184,000$287,500
Python C/C++ LLVM MLIR Reinforcement_learning Genetic_algorithms Predictive_modeling LLMs CI/CD GPU_architecture Scalability Reliability Performance_engineering Open_source_contribution
Remote

Senior AI Compiler Engineer

Nvidia

Remote (Austin, TX) +1 35 days ago $184,000$287,500
Python C/C++ Julia Lisp LLVM GPU reinforcement_learning genetic_algorithms predictive_modeling complex_systems AI ML optimization_passes code_generation frontend_integration
Remote

Machine Learning Compiler Engineer

Qualcomm

New York, NY +2 15 days ago $200,800$301,200
MLIR LLVM Pytorch 2.0 TVM Triton SYCL Python C++ CUDA OpenCL Polyhedral Compiler Optimization Loop Transformation Vectorization GPU Programming High Performance Computing CI/CD Git Linux Docker

Senior Software Engineer - AI Inference

Nvidia

Remote (Santa Clara, CA) 63 days ago $152,000$241,500
Python C++ CUDA vLLM SGLang PyTorch Triton NCCL Dynamo CI/CD GPU InfiniBand Profiling Flamegraphs Microbenchmarks Concurrency Multi-threading Multi-process Kubernetes Docker PostgreSQL
Remote

Staff ML Compiler Engineer

General Motors (GM)

Remote (Sunnyvale, CA) +2 12 days ago $185,100$335,300
Python C++ MLIR ONNX TensorRT PyTorch TensorFlow JAX CUDA cuDNN cuBLAS CI/CD
Remote Hybrid