Senior Deep Learning Software Engineer, TensorRT Performance
Nvidia
At a glance
AI generatedNVIDIA seeks an experienced Deep Learning Software Engineer to join its dynamic research and development team focused on optimizing the performance of deep learning inference solutions across various NVIDIA accelerators. This role involves establishing benchmarking methodologies, identifying performance issues, and contributing features to open-source frameworks like TensorRT and PyTorch. The engineer will develop optimized model pipelines for areas such as quantization and memory management, collaborating with diverse teams to enhance generative AI, automotive, robotics, and speech understanding applications. Essential skills include strong C++ and Python programming, experience with DL frameworks and inference libraries, performance optimization knowledge, and proficiency in GPU architecture and deep learning models like Transformers. Prior contributions to major LLM inference frameworks or graph compilers are highly valued, as is expertise in CUDA or related domain-specific languages.
Skills
What you'll do
What we're looking for
Market check
This $124,000–$195,500 range sits above 21% of similar postings on FindRole.
Peer median band
$154,637–$241,500
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$166,100–$235,750
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 802 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.
Most-posted roles
More like this
Nvidia
Nvidia
Nvidia
Nvidia
Nvidia
Nvidia