Senior HPC Performance Engineer - AI for Science at Scale
Nvidia
At a glance
AI generatedJoin NVIDIA's Math Libraries team as a senior engineer focusing on kernel generation for AI and HPC applications, particularly in matrix operations, JITing, and fusions. You will scope, design, and implement high-quality numerical dense linear algebra software on GPUs, lead projects involving multiple engineers, mentor interns, and collaborate with product management to understand feature requirements and contribute to technical roadmaps. Key responsibilities include improving library performance through re-architecting and reducing maintenance overhead. Ideal candidates have a PhD, Master’s, or Bachelor's degree in Computer Science or Applied Math, 8+ years of HPC software development experience using C++, strong fundamentals in kernel generation for linear algebra, and expertise in parallel programming with CUDA, MPI, OpenMP, and OpenACC. Experience with low-level assembly optimization and operator fusion is a plus, as is knowledge of GPU hardware architecture and agile project management practices.
Skills
What you'll do
What we're looking for
Market check
This $184,000–$287,500 range sits above 75% of similar postings on FindRole.
Peer median band
$161,700–$247,600
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$162,000–$235,750
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 802 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.
Most-posted roles
More like this
Nvidia
Nvidia
Nvidia
Nvidia
General Motors (GM)
General Motors (GM)