Browse tech roles

Basic role filtering by workplace, salary floor, and post age. For full AI matching and advanced filtering upload your resume using AI Match.

6 of up to 20 (filtered)

Senior Inference Engineer, AIConfigurator for Dynamo

Nvidia

Remote (Santa Clara, CA) 5 days ago $184,000$287,500
Actively hiring Posted this week Verified listing Above market
Python Rust Kubernetes TensorRT-LLM vLLM SGLang Triton Inference Server Dynamo CI/CD GPU computing Distributed systems ML infrastructure High-performance model serving Data-driven performance analysis Benchmarking Optimization NVIDIA GPUs Disaggregated serving Prefill/decode separation KV cache management NCCL NIXL NVSHMEM Expert-parallel MoE inference
Remote

Senior DL Algorithms Engineer - Inference Performance

Nvidia

Remote (Santa Clara, CA) 43 days ago $152,000$241,500
Actively hiring Competitive pay
PyTorch NVIDIA_TRT-LLM vLLM SGLang FlashInfer GPU_architecture CUDA OpenCL Deep_Learning Neural_Networks Performance_profiling HPC Computer_Architecture Python C++
Remote

Senior Software Engineer - AI Inference

Nvidia

Remote (Santa Clara, CA) 64 days ago $152,000$241,500
Actively hiring Competitive pay
Python C++ CUDA vLLM SGLang PyTorch Triton NCCL Dynamo CI/CD GPU InfiniBand Profiling Flamegraphs Microbenchmarks Concurrency Multi-threading Multi-process Kubernetes Docker PostgreSQL
Remote