Senior Software Engineer - VLM Microservices for Neural Reconstruction

Nvidia

Actively hiring

Santa Clara, CA · Redmond, WA Posted 36 days ago $152,000–$241,500 / year

View original post Log in to save

At a glance

AI generated

TL;DR

As a Software Engineer at NVIDIA’s Omniverse NuRec SDK team, you will play a pivotal role in advancing the integration of cutting-edge 3D Vision Language Models into production environments. Your daily tasks include designing and optimizing containerized inference execution for state-of-the-art models, developing benchmarks to ensure accuracy and performance, and maintaining these systems throughout their lifecycle. You will also contribute features related to VLMs to open-source projects like vLLM while collaborating closely with research and product teams. Ideal candidates have a strong background in building distributed AI systems using Docker, Kubernetes, and modern inference platforms such as vLLM, Torch, and TRT-LLM, along with proficiency in Python and C++. Additionally, experience in 3D graphics, neural reconstruction, and low-level ML model optimization is highly valued.

Skills

Python C++ Kubernetes Docker vLLM Torch TRT TRT-LLM Helm REST gRPC CI/CD CUDA PostgreSQL Git 3D_computer_vision Neural_reconstruction Gaussian_Splatting

What you'll do

Design and optimize containerized inference execution for NVIDIA's latest 3D VLMs.
Develop benchmarks to validate models' accuracy and performance metrics.
Release and maintain production-grade AI systems throughout their lifecycle.
Contribute VLM-related features to open-source projects like vLLM.
Collaborate with Research and Product teams to influence common roadmaps.

What we're looking for

Master’s in Computer Science/Electrical Engineering with relevant experience.
Expertise in building, validating, and releasing production-grade AI distributed systems.
Deep knowledge of Docker, Kubernetes, REST, gRPC, Helm for distributed applications.
Hands-on experience with modern inference platforms like vLLM, Torch, TRT.
Proficiency in Python and C++ with strong software engineering fundamentals.
Track record of contributing to open-source projects or production-grade software.
Experience in ML model engineering including training, fine-tuning, distillation.

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $205k

This role $197k

$141k most similar roles pay here $252k

This role pays less than 52% of similar roles. Most pay $174,050–$235,750 — the shaded band above. At the midpoint, this role pays about $197k versus about $205k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 824 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 812 roles with salary data.

Most-posted roles

View all roles at Nvidia

Similar roles

Senior System Software Engineer - Neural Graphics SDKs

Nvidia

Santa Clara, CA 36 days ago $184,000–$287,500

Python C++ Kubernetes CI/CD CUDA Slang GLSL HLSL Metal Gaussian_Splatting Neural_Reconstruction NVIDIA_Omniverse GSplat Docker Git PostgreSQL

Save

Senior Systems Software Engineer - Deep Learning Solutions

Nvidia

Santa Clara, CA 84 days ago $224,000–$356,500

CUDA TensorRT TVM MLIR XLA Python C/C++ QNX Linux GPU Deep Learning Transformer Models Vision-Language Models Diffusion Models State Space Models Parallel Programming Memory Management Embedded Systems CI/CD MLPerf

Save

Senior Deep Learning Software Engineer, Inference

Nvidia

Remote (Santa Clara, CA) 29 days ago $184,000–$287,500

C++ Python CUDA NCCL NVSHMEM OAI_TRITON CUTLASS PyTorch vLLM SGLang FlashInfer Multi-GPU_Communications Deep_Learning_Frameworks Performance_Optimization GPU_Acceleration

Remote

Save

Senior Deep Learning Systems Engineer, Datacenters

Nvidia

Santa Clara, CA 27 days ago $184,000–$287,500

Python C/C++ CUDA PyTorch TensorFlow Linux Docker Slurm perf gprof nvidia-smi dcgm

Hybrid

Save

Senior Infrastructure Software Engineer, Deep Learning Libraries

Nvidia

Santa Clara, CA 41 days ago $152,000–$241,500

Python C++ Kubernetes Jenkins GitHub Actions GitLab Docker CMake HTML5 CSS NodeJS React Git Make Bazel Groovy CI/CD JUnit PyTest Ubuntu RedHat Windows QNX

Save

Senior Deep Learning Software Engineer

Nvidia

Santa Clara, CA 40 days ago $224,000–$356,500

Python PyTorch JAX CUDA TensorRT NVIDIA_TensorRT_LLM GPU_optimization CUTLASS Triton Deep_learning_frameworks Performance_analysis GPU_architecture High_performance_computing Model_inference Inference_optimization

Hybrid

Save