Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles

Nvidia

Quick summary

Work type: On-site
Location: Santa Clara, CA
Salary: $184,000–$287,500 / yr
Posted: 2 days ago
Nearby: 99+ roles within 25 mi

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $220k

This role $236k

$163k most similar roles pay here $301k

This role pays more than 73% of similar roles. Most pay $193,000–$246,150 — the shaded band above. At the midpoint, this role pays about $236k versus about $220k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 985 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 971 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles

Apply Now Log in to save

As a Deep Learning Engineer at NVIDIA, you will join the cutting-edge team driving the AI revolution in Embodied AI for autonomous vehicles (AVs). Your role involves designing and implementing state-of-the-art model optimization techniques such as speculative decoding and KV cache streaming to enhance real-time performance. You’ll also work on advanced compression methods like quantization and pruning to reduce model footprints while maintaining safety-critical accuracy, all within the PyTorch ecosystem. Additionally, you will collaborate with research teams to translate innovations into practical solutions for TensorRT conversion and deployment across diverse NVIDIA edge architectures. Essential skills include expert-level proficiency in PyTorch or similar frameworks, deep familiarity with TensorRT and CUDA, and experience with low-bit inference and custom high-performance kernels using CUDA or Triton. This role demands a thorough understanding of GPU architecture and the unique constraints of real-time robotics, including safety-critical determinism and ultra-low latency requirements.

Skills

PyTorch TensorRT CUDA vLLM TensorRT-LLM SGLang JAX NVIDIA_TensorRT CUTLASS Trition GPU_architecture low-bit_inference custom_high_performance_kernels model_optimization deep_learning_SDKs hardware_in_the_loop_testing layer_by_layer_model_profiling

What you'll do

Develop state-of-the-art model optimization techniques for real-time robotic execution.
Implement advanced compression methods like quantization and pruning to reduce model size without sacrificing accuracy.
Design high-performance inference strategies including automated sharding and efficient attention kernels.
Conduct detailed layer-by-layer profiling to identify and resolve compute and memory bottlenecks.
Automate deployment pipelines using PyTorch ecosystem tools for TensorRT conversion.

What we're looking for

Expert-level proficiency in PyTorch or similar ML frameworks.
Proven experience training, deploying, and optimizing large-scale DL models.
Deep familiarity with NVIDIA’s TensorRT and CUDA SDKs.
Experience implementing low-bit inference techniques for efficiency.
Strong background in GPU architecture and high-performance kernel development.
Active contributions to open-source libraries like vLLM and TensorRT-LLM.
Understanding of real-time robotics constraints, including safety-critical determinism.

Similar roles

Senior Research Engineer - Autonomous Vehicles

Nvidia

Santa Clara, CA 149 days ago $184,000–$287,500

PyTorch TensorFlow JAX Python C++ CUDA Kubernetes SLURM Reinforcement_Learning PPO SAC Q-learning GPU_Cluster HPC Distributed_Training_Systems Multimodal_Datasets Simulation_Infrastructure LLMs Policy_Learning Curriculum_Learning Domain_Randomization Reward_Shaping

Save

Senior Perception Engineer - Autonomous Vehicles

Nvidia

Santa Clara, CA 32 days ago $184,000–$287,500

PyTorch Python C++ CUDA DeepLearning MultiSensorFusion 3DComputerVision CameraCalibration SensorFusion DataVerification LossFunctionEngineering MLOps EmbeddedSystems RealTimeApplications KPIBuilding LargeScaleBenchmarking DataCollectionPrioritization LabelingPrioritization

Save

Senior Deep Learning Engineering - Autonomous Vehicles

Nvidia

Santa Clara, CA 24 days ago $224,000–$356,500

Python PyTorch LLMs VLMs vLLM SGLang SFT DPO GRPO Transformer architectures Deep learning algorithms

Save

Senior AI Architect, Foundation Models and SoC Co-Design – Autonomous Vehicles

Nvidia

Santa Clara, CA 6 days ago $208,000–$327,750

NVIDIA CUDA TensorRT Triton DRIVE Jetson Python C++ Docker Kubernetes AWS Git GitHub CI/CD PostgreSQL Redis Prometheus Grafana Multimodal_foundation_models Vision-Language-Action_(VLA)_models Mixture-of-Experts_(MoE) Transformer Diffusion_model

Save

Senior Software Systems Engineer - Autonomous Vehicles

Nvidia

Remote (Santa Clara, CA) 32 days ago $152,000–$241,500

Python Magic_Cyber_Systems_Engineer Cameo Model-Based_Systems_Engineering V-Model ISO_26262 Robotics_Development Sensing Perception Motion_Control Systems_Integration Test_Strategy_Development Data_Analysis AI_Tooling

Remote

Save

Senior Software Engineer - Autonomous Vehicles

Nvidia

Santa Clara, CA 20 days ago $224,000–$356,500

C++ Python ROS CUDA TensorFlow PyTorch Docker Kubernetes AWS CI/CD PostgreSQL SQLite Prometheus Grafana Git Linux Autonomous Vehicle Planning Motion Planning Rapid Prototyping Real-Time Systems

Save