Senior Deep Learning Performance Architect - LPU

Nvidia

Remote Actively hiring
CA · TX Posted 16 days ago $152,000$241,500 / year

At a glance

AI generated

TL;DR

NVIDIA is seeking a Senior Deep Learning Performance Architect to join its pioneering team focused on advancing AI Inference performance through innovative hardware-software co-design. This role involves designing cutting-edge GPU and system architectures, constructing and testing deep learning algorithms, and building efficient power and performance models to guide future hardware architecture decisions. The ideal candidate will collaborate across the company with software, research, and product teams to drive AI innovation, requiring a strong background in machine learning, deep learning, and computer architecture. Expert programming skills in C, C++, and Python, along with experience in GPU computing (CUDA) and HPC technologies, are essential, as is familiarity with systems-level performance modeling and analysis for improving AI Inference workloads.

Skills

Python C C++ CUDA MPI OpenMP HPC GPU Deep Learning Machine Learning Performance Modeling Systems Performance Analysis AI Inference Workloads CUDA Kernels Custom ASIC Hardware

What you'll do

  • Design innovative GPU and system architectures to enhance AI inference performance.
  • Construct and test deep learning algorithms and applications to improve efficiency.
  • Analyze the impact of hardware-software interactions on future AI systems.
  • Develop power and performance models for AI inference stacks to guide new HW.
  • Lead cross-functional teams in guiding the direction of AI innovation at NVIDIA.

What we're looking for

  • MS or PhD in CS, EE, Math or 5+ years of relevant experience
  • Expertise in C, C++, Python, GPU computing (CUDA), and HPC technologies
  • Strong mathematical foundation in machine learning and deep learning
  • Experience with systems-level performance modeling and analysis
  • Background in improving AI Inference workloads through CUDA kernel development

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $221k
This role $197k
$139k most similar roles pay here $276k

This role pays less than 67% of similar roles. Most pay $196,750–$246,150 — the shaded band above. At the midpoint, this role pays about $197k versus about $221k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 824 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 812 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Deep Learning Performance Architect

Nvidia

Santa Clara, CA 29 days ago $184,000$287,500
Python C++ GPU ASIC Deep Learning LLM Batching KV-cache Latency/Tuning Multi-node Scaling Memory Hierarchy Scalability System Architecture Performance Tuning Profiling Debugging

Senior Deep Learning Performance Architect

Nvidia

Santa Clara, CA 145 days ago $184,000$287,500
Python C C++ Pytorch JAX TensorRT CUDNN CUBLAS CUTLASS MLIR Triton CUDA OpenCL GPU Deep Learning ASIC Performance Modeling Architecture Simulation Profiling Analysis

Senior Deep Learning Performance Architect

Nvidia

Santa Clara, CA 145 days ago $184,000$287,500
Python C++ GPU Deep_Learning ASIC Transformer_Models Computer_Architecture Interconnect_Fabrics Parallel_Computing AI_Algorithms

Senior Deep Learning Computer Architect

Nvidia

Santa Clara, CA 145 days ago $184,000$287,500
C++ Python CUDA PyTorch GPU ComputerArchitecture DeepLearningKernels LLMWorkloads PerformanceAnalysis ParallelizationStrategies FusionStrategies
Hybrid