Senior DGX Cloud AI Infrastructure Software Engineer
Nvidia
At a glance
AI generatedAs a senior AI infrastructure software engineer on NVIDIA's DGX Cloud Lepton Team, you will design, build, and maintain large-scale AI platforms that enable efficient training, inferencing, and fine-tuning of models. Your day-to-day responsibilities include developing tools to optimize AI/ML workload efficiency, analyzing failures from application to hardware levels, enhancing infrastructure for reliability, and co-designing APIs with NVIDIA's resiliency stacks. You will also define metrics to track system reliability and collaborate in a culture that values learning and iterative improvement. The role requires expertise in Python, C/C++, Kubernetes, observability platforms like ELK and Prometheus, and experience with AI frameworks such as PyTorch and TensorFlow. Additionally, knowledge of NVIDIA GPUs, RDMA networks, and cloud-native infrastructure is essential for this high-impact position at the forefront of AI innovation.
Skills
What you'll do
What we're looking for
Market check
This $184,000–$287,500 range sits above 73% of similar postings on FindRole.
Peer median band
$168,000–$258,750
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$165,852–$246,150
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 801 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.
Most-posted roles
More like this
Nvidia
Nvidia
Allstate
Nvidia
Adobe
Nvidia