Senior Math Libraries Engineer – AI and HPC

Nvidia

Actively hiring
Remote (Us, Ca, Santa Clara, US) Posted 50 days ago $184,000$287,500 / year

At a glance

AI generated

TL;DR

Join NVIDIA's Math Libraries team as a senior engineer focusing on kernel generation for AI and HPC applications, particularly in matrix operations, JITing, and fusions. You will scope, design, and implement high-quality numerical dense linear algebra software on GPUs, lead projects involving multiple engineers, mentor interns, and collaborate with product management to understand feature requirements and contribute to technical roadmaps. Key responsibilities include improving library performance through re-architecting and reducing maintenance overhead. Ideal candidates have a PhD, Master’s, or Bachelor's degree in Computer Science or Applied Math, 8+ years of HPC software development experience using C++, strong fundamentals in kernel generation for linear algebra, and expertise in parallel programming with CUDA, MPI, OpenMP, and OpenACC. Experience with low-level assembly optimization and operator fusion is a plus, as is knowledge of GPU hardware architecture and agile project management practices.

Skills

C++ CUDA MPI OpenMP OpenACC pthread Python JIT Kernel generation Linear algebra Agile JIRA Assembly GPU HPC

What you'll do

  • Scope, design, and implement high-quality numerical dense linear algebra software on GPUs.
  • Lead execution of projects involving multiple engineers and coordinate with various teams.
  • Provide technical guidance to library engineers and mentor interns on complex projects.
  • Collaborate with product management to understand feature requirements and contribute to roadmaps.
  • Identify opportunities for re-architecting libraries to improve performance and reduce maintenance.

What we're looking for

  • 8+ years of experience in HPC software development with C++.
  • Strong background in kernel generation and composable library design for linear algebra.
  • Proven leadership in driving complex software development projects.
  • Expertise in parallel programming, preferably with CUDA, MPI, OpenMP, or OpenACC.
  • Knowledge of GPU hardware architecture and machine learning technologies.
  • Experience with low-level performance optimization using assembly and operator fusion.
  • Familiarity with agile software development practices and project management tools.

Market check

Salary context

This $184,000–$287,500 range sits above 75% of similar postings on FindRole.

Peer median band

$161,700$247,600

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$162,000$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 802 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior HPC Performance Engineer - AI for Science at Scale

Nvidia

Us, Ca, Santa Clara, US 100 days ago $184,000$287,500
CUDA Python C++ PyTorch JAX Warp HPC Distributed Learning Atomistic Modeling CI/CD Git Linux NVIDIA DGX Systems GPU Programming Parallel Computing Data Structures Algorithm Design Machine Learning Frameworks Scientific AI Codebases Computational Chemistry Digital Biology

Senior Math Libraries Engineer - Sparsity in AI

Nvidia

Remote (Us, Ca, Santa Clara, US) 48 days ago $184,000$287,500
C++ CUDA Python PyTorch MLIR TACO MPI OpenMP CI/CD JIRA GPU Sparse_Computations LLMs Deep_Learning Numerical_Linear_Algebra HPC
Remote

Senior AI and HPC Observability Engineer

Nvidia

Us, Ca, Santa Clara, US 87 days ago $152,000$241,500
Python Go Java Kubernetes OpenTelemetry Prometheus Kafka Spark Flink PromQL Docker CI/CD Git Linux AWS GCP Azure

Senior ML Engineer, ML compute

General Motors (GM)

Remote (Gm Automation - Sunnyvale - Gm Automation - Sunnyvale, US) 98 days ago $155,420$395,900
Python Kubernetes Go C++ GCP Azure AWS PyTorch TorchX Ray Docker CI/CD
Remote

Senior AI/ML Capacity Engineer

General Motors (GM)

Remote (Gm Automation - Sunnyvale - Gm Automation - Sunnyvale, US) 62 days ago $144,700$261,300
Python Pandas NumPy BigQuery Looker Git Linux GCP CI/CD Docker Kubernetes Prometheus SQL Forecasting BI platforms Modern ML system architecture Capacity planning Data modeling
Remote