Browse tech roles

Basic role filtering by workplace, salary floor, and post age. For full AI matching and advanced filtering upload your resume using AI Match.

20 of up to 20 (filtered)

AI Inference Performance Engineer - New College Grad 2026

Nvidia

Santa Clara, CA 10 days ago $124,000$195,500
Actively hiring Verified listing Below market
Python C++ PyTorch JAX TensorRT-LLM vLLM SGLang CUDA CUTLASS cuteDSL tilelang OpenAI_Triton torch.compile MPI NCCL K8s roofline_analysis performance_profiling GPU_programming deep_learning_inference

AI/ML Technical Leader - Language Model Inference & AI Ops

Cisco

San Jose, CA 12 days ago $212,300$275,800
Actively hiring Verified listing Competitive pay
Python PyTorch TensorFlow Kubernetes CI/CD vLLM TensorRT-LLM Triton SGLang llama.cpp NVIDIA Nsight Prometheus Grafana PostgreSQL Java C++ ML lifecycle tooling Model registry Experiment tracking Observability
Hybrid

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, NY +3 12 days ago $197,300$225,100
Actively hiring Competitive pay
Python Docker Kubernetes AWS CI/CD PostgreSQL Redis Git Terraform Flask React Jenkins Ansible Prometheus Grafana

Engineering Manager, Inference Benchmarking — AI Perf

Nvidia

Remote (Santa Clara, CA) +1 16 days ago $224,000$356,500
Actively hiring Above market
Kubernetes vLLM TRT-LLM SGLang DCGM PyNVML Prometheus ZMQ Helm CI/CD Python Linux GPU TensorRT MLPerf OpenSource Docker Git GitHub MLOps
Remote

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York +2 32 days ago $197,300$225,100
Actively hiring Competitive pay
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Git Jenkins Prometheus Grafana

Head of GTM, AI Inference at Cloudflare

Cloudflare, Inc

San Francisco, CA +1 35 days ago
Actively hiring Verified listing
AWS Azure Google Cloud Kubernetes Docker CI/CD Prometheus Grafana Python SQL PostgreSQL MongoDB Salesforce Tableau Git GitHub Jira Confluence Scrum Agile DevOps Machine Learning AI Infrastructure GPU Technology Cloud Computing Developer Platforms Financial Modeling Data Analysis Investment Banking Management Consulting
Hybrid

Senior Software Engineer, AI Inference Systems

Nvidia

Santa Clara, CA 46 days ago $184,000$287,500
Actively hiring Above market
Python C/C++ CUDA Kubernetes Docker Triton PyTorch vLLM SGLang MLIR Linux Go Rust CI/CD AWS GCP Azure Prometheus Grafana GitHub MLOps
Hybrid

Compiler Engineer - AI Inference

Nvidia

Santa Clara, CA 50 days ago $152,000$241,500
Actively hiring Competitive pay
C/C++ Python MLIR LLVM CUDA TensorFlow PyTorch NVIDIA GPUs Datacenter Optimization CI/CD Git Linux Hardware/Software Co-Design Large Language Models(LLM)

Senior Software Engineer - AI Inference

Nvidia

Remote (Santa Clara, CA) 60 days ago $152,000$241,500
Actively hiring Competitive pay
Python C++ CUDA vLLM SGLang PyTorch Triton NCCL Dynamo CI/CD GPU InfiniBand Profiling Flamegraphs Microbenchmarks Concurrency Multi-threading Multi-process Kubernetes Docker PostgreSQL
Remote

Senior Director, NVIDIA AI Inference Sales

Nvidia

Santa Clara, CA 76 days ago $332,000$500,250
Actively hiring Above market
NVIDIA_AIMSDK NVIDIA_NeMo NVIDIA_Riva NVIDIA_NIMs NVIDIA_Triton NVIDIA_RAPIDS NVIDIA_Omniverse CUDA_X_Libraries AI_Software Cloud_Environments On_Prem_Environments Hybrid_Cloud_Environments ISVs SaaS_Sales Technical_Pre_Sales Enterprise_Workflows_Automation