Senior Deep Learning Systems Engineer, Datacenters

Nvidia

Hybrid Actively hiring
Santa Clara, US · Redmond, US Posted 22 days ago $184,000$287,500 / year

At a glance

AI generated

TL;DR

As a Deep Learning Systems Engineer at NVIDIA’s datacenter team, you will play a pivotal role in analyzing the performance and power consumption of deep learning applications on cutting-edge hardware. Your day-to-day responsibilities include developing software infrastructure to characterize DL applications, evolving cost-efficient datacenter architectures for Large Language Models (LLMs), and working with experts to create analysis tools using Python, bash, and C++. You will also analyze system characteristics and develop methodologies to measure performance metrics and identify efficiency improvements. The ideal candidate has a Bachelor’s degree in Electrical Engineering or Computer Science, preferably with an advanced degree, and at least 8 years of relevant experience in system software, silicon architecture, or performance modeling. Proficiency in C/C++ and Python is essential, along with exposure to containerization platforms like Docker and workload managers such as Slurm.

Skills

Python C/C++ CUDA PyTorch TensorFlow Linux Docker Slurm perf gprof nvidia-smi dcgm

What you'll do

  • Develop software infrastructure to analyze deep learning applications.
  • Evolve cost-efficient data center architectures for Large Language Models.
  • Create analysis tools in Python, bash, and C++ to measure DL performance metrics.
  • Analyze system characteristics of deep learning applications on Nvidia hardware.
  • Estimate efficiency improvements through performance metric measurements.

What we're looking for

  • Bachelor’s degree in Electrical Engineering or Computer Science, or equivalent experience.
  • 8+ years of relevant industry experience.
  • Deep understanding of computer system architecture and performance analysis.
  • Experience programming in C/C++ and Python; familiarity with containerization platforms (Docker) and workload managers (Slurm).
  • Expertise in at least one area: system software, silicon architecture, or deep learning frameworks.
  • Demonstrated ability to work independently and manage tasks from start to finish.

Market check

Salary context

This $184,000–$287,500 range sits above 76% of similar postings on FindRole.

Peer median band

$167,000$257,550

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$171,200$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Deep Learning Computer Architect

Nvidia

Us, Ca, Santa Clara, US 140 days ago $184,000$287,500
C++ Python CUDA PyTorch GPU ComputerArchitecture DeepLearningKernels LLMWorkloads PerformanceAnalysis ParallelizationStrategies FusionStrategies

Senior Deep Learning Software Engineer, Inference

Nvidia

Remote (Us, Ca, Santa Clara, US) 24 days ago $184,000$287,500
C++ Python CUDA NCCL NVSHMEM OAI_TRITON CUTLASS PyTorch vLLM SGLang FlashInfer Multi-GPU_Communications Deep_Learning_Frameworks Performance_Optimization GPU_Acceleration
Remote

Senior Deep Learning Software Engineer

Nvidia

US 85 days ago $224,000$356,500
Python PyTorch JAX CUDA TensorRT NVIDIA_TensorRT_LLM GPU_optimization CUTLASS Triton Deep_learning_frameworks Performance_analysis GPU_architecture High_performance_computing Model_inference Inference_optimization

#Senior Embedded Software Engineer, Cloud Edge and Data Center Machine Learning

Qualcomm

San Diego, Ca,Us, US 88 days ago $111,300$166,900
C GNU/LLVM BSP RTOS TrustZone Embedded Linux git Gerrit PyTorch JAX Llama.cpp PCIe LPDDR USB UFS ECC PCI AER I2C SPI SPMI AVSBus PMBus QuRT Glink QDSS DVFS DCVS compilers profilers source control systems emulators JTAG serial debuggers logic analyzers

#Senior Embedded Software Engineer, Cloud Edge and Data Center Machine Learning

Qualcomm

San Diego, Ca,Us, US 11 days ago $111,300$166,900
C GNU/LLVM BSP RTOS TrustZone Embedded Linux git Gerrit PyTorch JAX Llama.cpp PCIe LPDDR USB UFS ECC PCI AER I2C SPI SPMI AVSBus PMBus QuRT Glink QDSS DVFS DCVS compilers profilers source control systems emulators JTAG serial debuggers logic analyzers