Senior Software Engineer - AI Inference
Nvidia
Quick summary
Market check
How this pay compares to similar roles
This role pays more than 69% of similar roles. Most pay $145,487–$216,025 — the shaded band above. At the midpoint, this role pays about $197k versus about $181k for comparable roles.
Based on 240 similar postings.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 855 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 843 roles with salary data.
Most-posted roles
At a glance
NVIDIA is seeking a Senior Software Engineer to join its team focused on accelerating the deployment of efficient inference recipes for large language models (LLMs). The role involves translating recipe specifications into high-performance code within inference engines like vLLM, TRT-LLM, and SGLang, ensuring that quantized checkpoints serialize correctly for downstream serving. Key responsibilities include implementing quantized and sparse recipes, building benchmarking tools, developing data analysis tooling, and improving developer productivity through CI systems and training infrastructure. The ideal candidate is proficient in Python and C++, with a strong background in software engineering fundamentals, experience with ML accelerators, and familiarity with PyTorch internals or equivalent frameworks. Additionally, candidates should have a track record of contributing to large open-source projects and debugging numerical issues across mixed-precision boundaries.
Skills
What you'll do
What we're looking for
More like this
Nvidia
Nvidia
Nvidia
Nvidia
Nvidia
Nvidia