Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles

Nvidia

Quick summary

Work type
On-site
Location
Santa Clara, CA
Salary
$184,000–$287,500 / yr
Posted
2 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $220k
This role $236k
$163k most similar roles pay here $301k

This role pays more than 73% of similar roles. Most pay $193,000–$246,150 — the shaded band above. At the midpoint, this role pays about $236k versus about $220k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 985 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 971 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles

As a Deep Learning Engineer at NVIDIA, you will join the cutting-edge team driving the AI revolution in Embodied AI for autonomous vehicles (AVs). Your role involves designing and implementing state-of-the-art model optimization techniques such as speculative decoding and KV cache streaming to enhance real-time performance. You’ll also work on advanced compression methods like quantization and pruning to reduce model footprints while maintaining safety-critical accuracy, all within the PyTorch ecosystem. Additionally, you will collaborate with research teams to translate innovations into practical solutions for TensorRT conversion and deployment across diverse NVIDIA edge architectures. Essential skills include expert-level proficiency in PyTorch or similar frameworks, deep familiarity with TensorRT and CUDA, and experience with low-bit inference and custom high-performance kernels using CUDA or Triton. This role demands a thorough understanding of GPU architecture and the unique constraints of real-time robotics, including safety-critical determinism and ultra-low latency requirements.

What you'll do

  • Develop state-of-the-art model optimization techniques for real-time robotic execution.
  • Implement advanced compression methods like quantization and pruning to reduce model size without sacrificing accuracy.
  • Design high-performance inference strategies including automated sharding and efficient attention kernels.
  • Conduct detailed layer-by-layer profiling to identify and resolve compute and memory bottlenecks.
  • Automate deployment pipelines using PyTorch ecosystem tools for TensorRT conversion.

What we're looking for

  • Expert-level proficiency in PyTorch or similar ML frameworks.
  • Proven experience training, deploying, and optimizing large-scale DL models.
  • Deep familiarity with NVIDIA’s TensorRT and CUDA SDKs.
  • Experience implementing low-bit inference techniques for efficiency.
  • Strong background in GPU architecture and high-performance kernel development.
  • Active contributions to open-source libraries like vLLM and TensorRT-LLM.
  • Understanding of real-time robotics constraints, including safety-critical determinism.

More like this

Similar roles

Senior Research Engineer - Autonomous Vehicles

Nvidia

Santa Clara, CA 149 days ago $184,000$287,500
PyTorch TensorFlow JAX Python C++ CUDA Kubernetes SLURM Reinforcement_Learning PPO SAC Q-learning GPU_Cluster HPC Distributed_Training_Systems Multimodal_Datasets Simulation_Infrastructure LLMs Policy_Learning Curriculum_Learning Domain_Randomization Reward_Shaping

Senior Perception Engineer - Autonomous Vehicles

Nvidia

Santa Clara, CA 32 days ago $184,000$287,500
PyTorch Python C++ CUDA DeepLearning MultiSensorFusion 3DComputerVision CameraCalibration SensorFusion DataVerification LossFunctionEngineering MLOps EmbeddedSystems RealTimeApplications KPIBuilding LargeScaleBenchmarking DataCollectionPrioritization LabelingPrioritization

Senior Software Systems Engineer - Autonomous Vehicles

Nvidia

Remote (Santa Clara, CA) 32 days ago $152,000$241,500
Python Magic_Cyber_Systems_Engineer Cameo Model-Based_Systems_Engineering V-Model ISO_26262 Robotics_Development Sensing Perception Motion_Control Systems_Integration Test_Strategy_Development Data_Analysis AI_Tooling
Remote

Senior Software Engineer - Autonomous Vehicles

Nvidia

Santa Clara, CA 20 days ago $224,000$356,500
C++ Python ROS CUDA TensorFlow PyTorch Docker Kubernetes AWS CI/CD PostgreSQL SQLite Prometheus Grafana Git Linux Autonomous Vehicle Planning Motion Planning Rapid Prototyping Real-Time Systems