Browse tech roles

Filter the feed by workplace, employment type, salary floor, and post age. For ranked matching against your resume, use AI Match.

6 of up to 20 (filtered)

Software Engineer, TensorRT Specialized Platforms - New College Grad 2025

Nvidia

Santa Clara, CA 6 days ago $124,000$195,500
Actively hiring Posted this week Verified listing Competitive pay
C++ CUDA Python Modern C++ standards C++ Standard Template Library (STL) Deep learning models Performance optimization Systems programming Embedded systems Compiler concepts Software performance analysis Profiling techniques Computer architecture Memory management Parallel computing concepts

Senior AI-Native Systems Software Engineer, TensorRT

Nvidia

Santa Clara, CA 44 days ago $152,000$241,500
Actively hiring Competitive pay
C++ CUDA Python LLMs Diffusion Multi-modal models Agentic framework experience Performance profiling CUDA programming High-velocity prototyping Clean code Maintainable code End-to-end product sense Collaborative mindset Systems thinking
Hybrid

Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026

Nvidia

Remote (Santa Clara, CA) 62 days ago $124,000$195,500
Actively hiring Below market
C++ Python TensorRT PyTorch CUDA ONNX JAX TensorFlow performance analysis GPU architecture Transformers Recommenders ASR TTS Visual Understanding graph compilers Jetson systems deep learning inference low-latency systems resource-constrained systems
Remote

Senior Software Engineer, Deep Learning Inference - TensorRT

Nvidia

Santa Clara, CA 79 days ago $152,000$241,500
Actively hiring Competitive pay
C++ Python CUDA TensorRT PyTorch TensorFlow ONNX Runtime NVIDIA GPUs Machine Learning Performance Benchmarking Profiling Optimizations Compiler Development Graph Parsers Optimizers
Hybrid

Senior Software Engineer – TensorRT Edge-LLM

Nvidia

Remote (Santa Clara, CA) 79 days ago $152,000$241,500
Actively hiring Above market
C++ TensorRT CUDA vLLM SGLang MLC-LLM FlashInfer Transformer models Quantization Tensor parallelism Memory-efficient scheduling Speculative decoding KV cache management Compiler infrastructure Robotics Embedded AI pipelines Performance profiling GPU architecture
Remote