Browse tech roles

Filter the feed by workplace, employment type, salary floor, and post age. For ranked matching against your resume, use AI Match.

20 of up to 20 (filtered)

Senior/Staff Deep Reinforcement Learning Engineer

DoorDash, Inc

San Francisco, CA 1 day ago $168,000$247,000
Actively hiring Posted today Verified listing Competitive pay
JAX Python GPU-accelerated simulation Distributed training infrastructure Deep reinforcement learning Model-based RL Policy gradients Value functions Reward shaping Sim-to-real transfer Data-driven mindset Experiment pipelines CI/CD

Deep Learning Computer Architect - New College Grad 2026

Nvidia

Santa Clara, CA 3 days ago $124,000$195,500
Actively hiring Posted this week Verified listing Below market
C++ Python CUDA PyTorch GPU ComputerArchitecture PerformanceAnalysis DeepLearningKernels MLOps LLMWorkloads ParallelizationStrategies FusionStrategies
Hybrid

Senior Software Architect - Deep Learning and HPC Communications

Nvidia

Remote (Santa Clara, CA) 7 days ago $184,000$287,500
Actively hiring Posted this week Verified listing Above market
C/C++ MPI NCCL NVSHMEM UCX CUDA Linux InfiniBand RoCE NVLink PyTorch TensorFlow HPC Networking Simulation Quantitative_Modeling SHMEM Parallel_Programming Deep_Learning_Pods
Remote

Senior Deep Learning Framework Communications Engineer

Nvidia

Remote (Santa Clara, CA) 7 days ago $152,000$241,500
Actively hiring Posted this week Verified listing Competitive pay
PyTorch C++ CUDA Python NCCL NVSHMEM JAX TRT-LLM vLLM SGLang HPC AI MPI TensorRT NVIDIA_Nsight_Systems Performance_Profiling Parallel_Programming Compiler_Technologies Memory_Hierarchy Tensor_Layout Distributed_Inference Mixture_of_Experts Reinforcement_Learning
Remote

Distinguished Software Architect - Deep Learning and HPC Communications

Nvidia

Santa Clara, CA 13 days ago $320,000$488,750
Actively hiring Verified listing Above market
HPC MPI NCCL NVSHMEM UCX CUDA Infiniband Ethernet C C++ PyTorch TensorFlow GPU Networking System_Architecture Parallel_Programming_Models ML_DL_Fundamentals Performance_Optimization Fault_Tolerance Competitive_Assessments

Senior Deep Learning Communication Architect

Nvidia

Santa Clara, CA 15 days ago $184,000$287,500
Actively hiring Above market
PyTorch TensorRT-LLM vLLM SGLang C++ Python CUDA OpenCL InfiniBand RoCE MPI NCCL UCX UCC NVSHMEM Data Parallelism Pipeline Parallelism Tensor Parallelism Expert Parallelism FSDP Disaggregated Serving Dynamo Triton

Senior Deep Learning Performance Architect - LPU

Nvidia

Remote (CA) 17 days ago $152,000$241,500
Actively hiring Competitive pay
Python C C++ CUDA MPI OpenMP HPC GPU Deep Learning Machine Learning Performance Modeling Systems Performance Analysis AI Inference Workloads CUDA Kernels Custom ASIC Hardware
Remote

Senior Deep Learning Frameworks CUDA Software Engineer

Nvidia

Remote (Santa Clara, CA) 21 days ago $184,000$287,500
Actively hiring Above market
CUDA PyTorch JAX TRT-LLM vLLM SGLang Python C++ NCCL MPI UCX Docker CI/CD Prometheus Grafana Git GitHub Linux NVIDIA_Nsight_Systems
Remote

Senior Software Engineer, CUDA Deep Learning Systems

Nvidia

Remote (Santa Clara, CA) 21 days ago $184,000$287,500
Actively hiring Above market
CUDA Python C++ PyTorch JAX TensorRT vLLM Nemo Megatron MaxText Triton XLA NCCL MPI UCX Docker CI/CD Git GitHub Linux PostgreSQL Prometheus Grafana
Remote

Senior Deep Learning Tools Engineer – CUDA Tile

Nvidia

Remote (Santa Clara, CA) 29 days ago $152,000$241,500
Actively hiring Competitive pay
Python C++ CI/CD PyTorch TensorFlow JAX TensorRT LLVM MLIR CUDA Docker Kubernetes Prometheus Grafana PostgreSQL Git GitHub Linux
Remote

Senior Deep Learning Software Engineer, Inference

Nvidia

Remote (Santa Clara, CA) 30 days ago $184,000$287,500
Actively hiring Above market
C++ Python CUDA NCCL NVSHMEM OAI_TRITON CUTLASS PyTorch vLLM SGLang FlashInfer Multi-GPU_Communications Deep_Learning_Frameworks Performance_Optimization GPU_Acceleration
Remote

Senior Deep Learning Performance Architect

Nvidia

Santa Clara, CA 30 days ago $184,000$287,500
Actively hiring Above market
Python C++ GPU ASIC Deep Learning LLM Batching KV-cache Latency/Tuning Multi-node Scaling Memory Hierarchy Scalability System Architecture Performance Tuning Profiling Debugging

Principal AI Engineer - Advanced AI (Machine Learning, Python, Deep Learning)

Target

Brooklyn Park, MN 31 days ago $168,000$303,000
Actively hiring Competitive pay
Python PyTorch TensorFlow LLM-based systems Agentic systems AI engineering tooling Cloud ML platforms Containers Orchestration technologies CI/CD Version control Code review practices Operational monitoring Agile principles Prometheus Grafana
Hybrid

Senior Deep Learning Compiler Engineer

Nvidia

Remote (Santa Clara, CA) 35 days ago $152,000$241,500
Actively hiring Competitive pay
MLIR XLA TVM LLVM PyTorch CUDA C++ Python GPU CPU Embedded_Systems Cross_Compilation CI/CD
Remote

Senior Deep Learning Compiler Verification Engineer

Nvidia

Remote (Santa Clara, CA) 36 days ago $140,000$224,250
Actively hiring Below market
Python C++ PyTorch JAX TensorRT LLVM MLIR TVM XLA Type Systems Program Semantics Proof-Based Verification Quantization Operator Fusion Mixed-Precision Graph-Level Optimization
Remote