Senior Software Engineer - AI Inference
Nvidia
At a glance
AI generatedJoin NVIDIA as a senior software engineer on our cutting-edge AI inference team, where you will architect high-performance inference stacks for large-scale models, optimize GPU kernels, and drive industry benchmarks. Your daily tasks include contributing features to vLLM that leverage the latest NVIDIA hardware, developing optimized GPU kernels using techniques like fusion and autotuning, and building benchmarking methodologies. You’ll also design scheduling systems for containerized deployments on multi-GPU clusters across clouds. Ideal candidates have a strong background in computer science with extensive experience in Python and C/C++, along with knowledge of CUDA, Kubernetes, and ML frameworks such as PyTorch and vLLM. Experience with ML compilers like Triton and GPU libraries like CUTLASS is highly valued. This role offers the opportunity to work on groundbreaking AI technologies that push the boundaries of performance engineering and system optimization at scale.
Skills
What you'll do
What we're looking for
Market check
This $184,000–$287,500 range sits above 72% of similar postings on FindRole.
Peer median band
$170,700–$247,000
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$168,250–$246,150
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 801 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.
Most-posted roles
More like this
Nvidia
Booz Allen Hamilton
Booz Allen Hamilton
Smartly
Plaid
Nvidia