Principal Software Engineer - AI Inference
Nvidia
At a glance
AI generatedNVIDIA is hiring a Senior Software Engineer – AI Inference to contribute to open-source LLM serving engines like vLLM and SGLang by enhancing their performance on NVIDIA GPUs and systems. The role involves daily tasks such as writing features, optimizations, and tests for these engines, implementing efficient inference runtime capabilities, profiling hot paths across layers from Python orchestration to CUDA kernels, improving multi-GPU performance, and building regression tests. Ideal candidates have 5+ years of experience in production software development with a focus on systems engineering, strong programming skills in Python, C++, and CUDA, expertise in profiling tools like microbenchmarks and flame graphs, and familiarity with distributed systems concepts. Experience with open-source contributions to projects such as vLLM, SGLang, or PyTorch is highly valued, along with a background in building benchmarking infrastructure for latency/throughput.
Skills
What you'll do
What we're looking for
Market check
This $152,000–$241,500 range sits above 45% of similar postings on FindRole.
Peer median band
$168,000–$241,500
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$162,000–$235,750
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 801 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.
Most-posted roles
More like this
Nvidia
Nvidia
Plaid
Booz Allen Hamilton
Booz Allen Hamilton
The Walt Disney Company