Compiler Engineer - AI Inference

Nvidia

Quick summary

Work type: On-site
Location: Santa Clara, CA
Salary: $152,000–$241,500 / yr
Posted: 54 days ago
Nearby: 99+ roles within 25 mi

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $193k

This role $197k

$139k most similar roles pay here $252k

This role pays more than 52% of similar roles. Most pay $160,000–$225,400 — the shaded band above. At the midpoint, this role pays about $197k versus about $193k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 992 open roles on FindRole.

Listed pay typically runs $168,000–$264,500 across 979 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Compiler Engineer - AI Inference

Apply Now Log in to save

NVIDIA is hiring senior-level AI Compiler Engineers to join their pioneering team, where you will play a crucial role in advancing AI performance by developing cutting-edge kernel generation and computational graph optimizations for next-generation NVIDIA GPUs. Your responsibilities include solving complex compilation challenges for both inference and training workloads, collaborating with experts across software, hardware, and research divisions on co-design initiatives, and scaling AI deployments to datacenter environments. Ideal candidates possess a BS or MS in Computer Science or related fields, along with 3+ years of industry experience in compiler optimizations and hands-on MLIR expertise. Strong skills in C/C++ and Python are essential, as is the ability to design comprehensive compiler frameworks from scratch, alongside deep knowledge of Large Language Model inference and its impact on computer architecture.

Skills

C/C++ Python MLIR LLVM NVIDIA GPUs AI workloads Datacenter optimization CI/CD Git Linux CUDA TensorFlow PyTorch HPC Docker Kubernetes AWS GCP Azure PostgreSQL MongoDB

What you'll do

Drive technical innovation by developing kernel generation and computational graph optimizations for NVIDIA GPUs.
Solve complex compilation problems for AI workloads to enhance both inference and training performance.
Collaborate with hardware experts to co-design future silicon architectures for AI applications.
Advance datacenter-scale AI workload deployments through optimization and scalability improvements.
Design comprehensive compiler frameworks from the ground up, showcasing deep architecture understanding.

What we're looking for

3+ years of industry experience in compiler optimizations, synthesis, and placement.
Strong hands-on experience with MLIR and Large Language Model (LLM) inference.
Exceptional skills in C/C++ and Python programming, debugging, performance analysis.
BS or MS in Computer Science/Engineering or equivalent; PhD preferred.
Experience implementing complex AI workloads on CPU/GPU/custom AI accelerators.
Proven ability to design comprehensive compiler frameworks from scratch.

Similar roles

Senior Compiler Engineer, AI Inference Platforms

Nvidia

Remote (Santa Clara, CA) +1 114 days ago $152,000–$241,500

MLIR LLVM XLA Triton PyTorch JAX Nsight Compute C++ CUDA Python

Remote

Save

Senior Compiler Engineer - AI

Nvidia

Remote (Austin, TX) +4 33 days ago $184,000–$287,500

Python C/C++ LLVM MLIR Reinforcement_learning Genetic_algorithms Predictive_modeling LLMs CI/CD GPU_architecture Scalability Reliability Performance_engineering Open_source_contribution

Remote

Save

Senior AI Compiler Engineer

Nvidia

Remote (Austin, TX) +1 36 days ago $184,000–$287,500

Python C/C++ Julia Lisp LLVM GPU reinforcement_learning genetic_algorithms predictive_modeling complex_systems AI ML optimization_passes code_generation frontend_integration

Remote

Save

Machine Learning Compiler Engineer

Qualcomm

New York, NY +2 16 days ago $200,800–$301,200

MLIR LLVM Pytorch 2.0 TVM Triton SYCL Python C++ CUDA OpenCL Polyhedral Compiler Optimization Loop Transformation Vectorization GPU Programming High Performance Computing CI/CD Git Linux Docker

Save

Senior Software Engineer - AI Inference

Nvidia

Remote (Santa Clara, CA) 64 days ago $152,000–$241,500

Python C++ CUDA vLLM SGLang PyTorch Triton NCCL Dynamo CI/CD GPU InfiniBand Profiling Flamegraphs Microbenchmarks Concurrency Multi-threading Multi-process Kubernetes Docker PostgreSQL

Remote

Save

Staff ML Compiler Engineer

General Motors (GM)

Remote (Sunnyvale, CA) +2 13 days ago $185,100–$335,300

Python C++ MLIR ONNX TensorRT PyTorch TensorFlow JAX CUDA cuDNN cuBLAS CI/CD

Remote Hybrid

Save