Senior Software Engineer - AI Inference
Nvidia
Quick summary
Market check
How this pay compares to similar roles
This role pays more than 72% of similar roles. Most pay $167,449–$245,112 — the shaded band above. At the midpoint, this role pays about $236k versus about $206k for comparable roles.
Based on 240 similar postings.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 980 open roles on FindRole.
Listed pay typically runs $168,000–$270,250 across 966 roles with salary data.
Most-posted roles
At a glance
NVIDIA is seeking a Senior Inference Engineer to join the AIConfigurator team and enhance its system for discovering high-performance deployment configurations for large-scale LLM inference. The role involves building and evolving the core optimization engine, creating production-quality APIs and SDKs in Python and Rust, and developing backend-specific artifacts for various NVIDIA platforms. Engineers will collaborate with multiple teams to ensure simulated performance matches real-world deployments on GPUs like H100 and H200, while also improving model support through integration of profiling data and validation tools. Ideal candidates have extensive experience in GPU computing, distributed systems, and ML infrastructure, along with strong Python/Rust skills and a deep understanding of LLM inference concepts such as batching and parallelism strategies.
Skills
What you'll do
What we're looking for
More like this
Nvidia
Nvidia
Nvidia
The Hartford
Adobe
Capital One Financial