Senior DL Algorithms Engineer - Inference Performance
Nvidia
At a glance
AI generatedJoin NVIDIA as a Senior DL Algorithms Engineer and contribute to the optimization of Deep Learning workloads by implementing inference for language and multimodal models within NVIDIA Inference Microservices (NIMs). You will enhance TRT-LLM, an open-source inference serving library, through feature development and bug fixing while profiling bottlenecks across the entire stack to maximize performance. Collaborate with cross-functional teams to benchmark state-of-the-art DL model inferences and optimize the NVIDIA software/hardware stack for cutting-edge AI services. Ideal candidates hold a PhD or equivalent experience, possess deep expertise in deep learning inference, and are proficient in C++, PyTorch, and GPU programming (CUDA/OpenCL). Strong knowledge of computer architecture and modern LLM architectures is essential.
Skills
What you'll do
What we're looking for
Market check
This $184,000–$287,500 range sits above 74% of similar postings on FindRole.
Peer median band
$165,000–$245,600
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$182,125–$238,250
Middle half of comparable postings.
Based on 239 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 802 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.
Most-posted roles
More like this
Nvidia
Nvidia
Nvidia
Nvidia
General Motors (GM)
Autodesk