Senior Software Architect, AI Systems and Networking

Nvidia

Remote

Quick summary

Work type
Remote
Location
Santa Clara, CA · Austin, TX
Salary
$224,000–$356,500 / yr
Posted
16 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $204k
This role $290k
$141k most similar roles pay here $380k

This role pays more than 97% of similar roles. Most pay $171,387–$236,487 — the shaded band above. At the midpoint, this role pays about $290k versus about $204k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 855 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 843 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Senior Software Architect, AI Systems and Networking

As a Senior Architect in NVIDIA’s Networking Systems & Software Architecture group, you will lead the development of high-performance communication and memory management libraries for distributed AI systems. This role involves driving hardware-software co-optimization with GPU, DPU, NIC, and switch teams using technologies like GPUDirect RDMA and NVLink, while also profiling and optimizing data movement across various memory types and network fabrics. You will integrate networking capabilities into AI serving stacks such as vLLM, SGLang, and TensorRT-LLM, contribute to open-source projects, mentor engineers, and prototype experimental technologies. The ideal candidate has over 12 years of experience in systems software or networking, a strong background in high-performance networking, and expertise in C/C++/Rust programming with knowledge of ML inference frameworks and storage networking protocols.

What you'll do

  • Architect high-performance communication and memory management libraries for distributed AI.
  • Drive hardware-software co-optimization with GPU, DPU, NIC, and switch teams.
  • Profile and optimize data movement across GPU memory, system DRAM, NVMe, and network fabrics.
  • Integrate networking capabilities into AI serving stacks like vLLM and TensorRT-LLM.
  • Prototype experimental technologies to evaluate their viability in production environments.

What we're looking for

  • 12+ years of experience in systems software or networking with proven project ownership.
  • MS, PhD, or equivalent experience in Computer Science, Engineering, or related field.
  • Expertise in high-performance networking technologies like InfiniBand, RoCE, RDMA.
  • Proficient in C/C++/Rust programming for system-level development and debugging.
  • Knowledge of ML systems concepts including transformer architectures and distributed training.
  • Understanding of ML inference frameworks such as vLLM, SGLang, TensorRT-LLM.
  • Familiarity with storage networking technologies like NVMe-oF and GPUDirect Storage.

More like this

Similar roles

Senior Software Engineer, AI Networking

Nvidia

Santa Clara, CA 21 days ago $152,000$241,500
Python PyTorch TensorFlow JAX CUDA NCCL Reinforcement_Learning Bayesian_Optimization GNNs Docker Kubernetes CI/CD Prometheus Grafana Bash C++ PostgreSQL Redis

Senior Software Engineer, AI Networking

Nvidia

Austin, TX 74 days ago $184,000$287,500
C C++ RDMA verbs DPDK DOCA NCCL CUDA InfiniBand RoCE Docker Kubernetes AWS CI/CD Prometheus Grafana Python PostgreSQL

Senior Software Manager, AI Networking

Nvidia

Remote (Santa Clara, CA) 20 days ago $272,000$431,250
BlueField ConnectX Spectrum-X DOCA RDMA RoCE InfiniBand DPDK NCCL CUDA-aware networking congestion control telemetry CI/CD Kubernetes AWS GCP Azure Python Shell scripting Prometheus Grafana
Remote

Senior Solution Architect, AI Infrastructure

Nvidia

Remote (Us, Dc, Remote, US) 24 days ago $184,000$287,500
NVIDIA_GPUs NVIDIA_Networking InfiniBand Ethernet NCCL DCGM UFM Mission_Control Base_Command_Manager AI_solutions High_Performance_Computing Networking Python CI/CD Git AWS Azure Grafana Prometheus
Remote

Senior Solutions Architect, AI Infrastructure

Nvidia

Santa Clara, CA 146 days ago $184,000$287,500
NVIDIA_GPU ARM_Development C Python Embedded_Linux_Systems NCCL DCGM UFM APIs OEM_Working_Experience Industrial_Computing Military_Computing Ruggedized_Computing CI/CD