Senior Software Engineer - AI Inference
Nvidia
At a glance
AI generatedNVIDIA is seeking a Principal Software Engineer to lead the advancement of open-source LLM serving technologies like vLLM and SGLang, ensuring they excel on NVIDIA GPUs. This role involves hands-on development to enhance high-throughput, low-latency inference at scale by building features that improve efficiency and tail behavior, optimizing core hot paths, and improving multi-GPU and multi-node performance. The ideal candidate will have extensive experience in systems engineering, particularly with LLM inference/serving systems, and strong programming skills in Rust, C++, Python, and CUDA. They should also possess expertise in GPU performance analysis tools, distributed systems, and open-source contributions to projects like vLLM or SGLang. This position requires a deep understanding of the challenges in large-scale AI infrastructure and the ability to mentor senior engineers while raising the technical bar within NVIDIA.
Skills
What you'll do
What we're looking for
Market check
This $272,000–$431,250 range sits above 99% of similar postings on FindRole.
Peer median band
$153,600–$241,500
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$164,625–$235,750
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 801 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.
Most-posted roles
More like this
Nvidia
Broadcom
Booz Allen Hamilton
PNC
Nvidia
Abbott