Senior Performance Engineer - Deep Learning
Nvidia
At a glance
AI generatedJoin NVIDIA as a Deep Learning Engineer to enhance AI stacks by integrating advanced communication technologies like PyTorch, TRT-LLM, vLLM, SGLang, JAX, and more. You will work closely with the team behind NCCL, NVSHMEM, and GPUDirect to optimize multi-GPU communications for diverse demands from training on up to 100K GPUs to microsecond latency inference. Your daily tasks include analyzing AI workloads, improving compilers, designing fault-tolerant solutions, and authoring custom kernels. Ideal candidates have a B.S., M.S., or Ph.D. in Computer Science with extensive experience in HPC/AI, proficiency in Python, C++, CUDA, and familiarity with performance profiling tools like PyTorch profiler and NVIDIA Nsight Systems.
Skills
What you'll do
What we're looking for
Market check
This $152,000–$241,500 range sits above 40% of similar postings on FindRole.
Peer median band
$163,450–$257,300
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$180,025–$246,150
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 801 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.
Most-posted roles
More like this
Nvidia
Nvidia
Nvidia
Nvidia
Nvidia
Nvidia