Browse tech roles

Filter the feed by workplace, employment type, salary floor, and post age. For ranked matching against your resume, use AI Match.

20 of up to 20 (filtered)

Senior Data Scientist - Inference, Global Markets - Careers

Airbnb

Remote (China) 2 days ago
Actively hiring Posted this week Verified listing
SQL Python R A/B testing Causal inference Data modeling Machine learning Cloud services CI/CD Docker Kubernetes Terraform AWS PostgreSQL Git Jupyter Notebook Tableau Prometheus Grafana
Remote

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, NY 4 days ago $197,300$225,100
Actively hiring Posted this week Competitive pay
Python Docker Kubernetes AWS CI/CD Terraform PostgreSQL Redis Git Jenkins

AI/ML Technical Leader - Language Model Inference & AI Ops

Cisco

San Jose, CA 4 days ago $212,300$275,800
Actively hiring Posted this week Verified listing Above market
Python PyTorch TensorFlow Kubernetes CI/CD vLLM TensorRT-LLM Triton SGLang llama.cpp NVIDIA Nsight Prometheus Grafana PostgreSQL Java C++ ML lifecycle tooling Model registry Experiment tracking Observability
Hybrid

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, NY 4 days ago $197,300$225,100
Actively hiring Posted this week Competitive pay
Python Docker Kubernetes AWS CI/CD PostgreSQL Redis Git Terraform Flask React Jenkins Ansible Prometheus Grafana

Engineering Manager, Inference Benchmarking — AI Perf

Nvidia

Remote (Santa Clara, CA) 8 days ago $224,000$356,500
Actively hiring Verified listing Above market
Kubernetes vLLM TRT-LLM SGLang DCGM PyNVML Prometheus ZMQ Helm CI/CD Python Linux GPU TensorRT MLPerf OpenSource Docker Git GitHub MLOps
Remote

Senior ML Infrastructure Engineer, Inference Platform

General Motors (GM)

Sunnyvale, CA 8 days ago $155,420$205,900
Actively hiring Below market
Python Triton RayServe vLLM C++ Kubernetes Docker CI/CD Prometheus Grafana PostgreSQL Redis AWS Azure Google Cloud Platform Git Jenkins GitHub Slack Confluence Jira
Hybrid

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York 24 days ago $197,300$225,100
Actively hiring Competitive pay
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Git Jenkins Prometheus Grafana

Senior Deep Learning Software Engineer, Inference

Nvidia

Remote (Santa Clara, CA) 31 days ago $184,000$287,500
Actively hiring Above market
C++ Python CUDA NCCL NVSHMEM OAI_TRITON CUTLASS PyTorch vLLM SGLang FlashInfer Multi-GPU_Communications Deep_Learning_Frameworks Performance_Optimization GPU_Acceleration
Remote

Senior DL Algorithms Engineer - Inference Performance

Nvidia

Remote (Santa Clara, CA) 31 days ago $152,000$241,500
Actively hiring Competitive pay
PyTorch NVIDIA_TRT-LLM vLLM SGLang FlashInfer GPU_architecture CUDA OpenCL Deep_Learning Neural_Networks Performance_profiling HPC Computer_Architecture Python C++
Remote

Senior Software Engineer, AI Inference Systems

Nvidia

Santa Clara, CA 38 days ago $184,000$287,500
Actively hiring Above market
Python C/C++ CUDA Kubernetes Docker Triton PyTorch vLLM SGLang MLIR Linux Go Rust CI/CD AWS GCP Azure Prometheus Grafana GitHub MLOps
Hybrid

Compiler Engineer - AI Inference

Nvidia

Santa Clara, CA 42 days ago $152,000$241,500
Actively hiring Competitive pay
C/C++ Python MLIR LLVM CUDA TensorFlow PyTorch NVIDIA GPUs Datacenter Optimization CI/CD Git Linux Hardware/Software Co-Design Large Language Models(LLM)

Deep Learning Architect, LLM Inference - New College Grad 2026

Nvidia

Santa Clara, CA 44 days ago $124,000$195,500
Actively hiring Below market
PyTorch TRT-LLM vLLM SGLang OpenAI API MCP Python CUDA Prometheus Grafana Docker CI/CD GitHub GitLab Jupyter Notebook PostgreSQL MongoDB Kubernetes AWS Azure Google Cloud Platform

Solutions Architect, Inference Deployments

Nvidia

Santa Clara, CA 51 days ago $152,000$241,500
Actively hiring Competitive pay
NVIDIA_Dynamo Kubernetes TensorRT-LLM vLLM SGLang Triton_Inference_Server NVIDIA_GPU_Operator NIM_Operator MIG_Partitioning RDMA UCX Quantization Speculative_Decoding WideEP NVIDIA_Certified_AI_Engineer CI/CD

Solutions Architect, Inference Deployments

Nvidia

Santa Clara, CA 51 days ago $152,000$241,500
Actively hiring Competitive pay
NVIDIA_Dynamo Kubernetes TensorRT-LLM vLLM SGLang Triton_Inference_Server NVIDIA_GPU_Operator NIM_Operator MIG_Partitioning RDMA UCX Quantization Speculative_Decoding WideEP NVIDIA_TensorRT PostgreSQL CI/CD GitHub Prometheus Grafana

Senior Software Engineer - AI Inference

Nvidia

Remote (Santa Clara, CA) 52 days ago $152,000$241,500
Actively hiring Competitive pay
Python C++ CUDA vLLM SGLang PyTorch Triton NCCL Dynamo CI/CD GPU InfiniBand Profiling Flamegraphs Microbenchmarks Concurrency Multi-threading Multi-process Kubernetes Docker PostgreSQL
Remote

Senior Software Engineer, Machine Learning Inference

Nvidia

Santa Clara, CA 56 days ago $152,000$241,500
Actively hiring Competitive pay
C++ Python CUDA Rust TensorRT TensorRT-LLM vLLM SGLang PyTorch JAX Deep Learning Frameworks GPU Programming Performance Analysis Optimization Techniques CI/CD
Hybrid