Senior Software Architect, Deep Learning and HPC Communications

Nvidia

Remote

Quick summary

Work type
Remote
Location
Santa Clara, CAWestford, MAAustin, TXDurham, NC
Salary
$224,000–$356,500 / yr
Posted
3 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $216k
This role $290k
$149k most similar roles pay here $379k

This role pays more than 97% of similar roles. Most pay $196,275–$235,750 — the shaded band above. At the midpoint, this role pays about $290k versus about $216k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 942 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 931 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Senior Software Architect, Deep Learning and HPC Communications

As a Senior Software Architect at NVIDIA, you will join a cutting-edge team focused on developing next-generation data center platforms and scalable communications software for AI and High Performance Computing (HPC). Your primary responsibilities include identifying performance bottlenecks in current systems, designing innovative communication technologies to enhance AI and HPC workloads, and conducting proofs-of-concept and quantitative modeling. You will leverage C/C++ programming, parallel models like MPI and SHMEM, and high-speed interconnects such as NVLink and InfiniBand to build scalable solutions for large GPU clusters. This role requires a deep understanding of operating systems, computer architecture, and network fundamentals, along with experience in CUDA programming and familiarity with deep learning frameworks like PyTorch and TensorFlow.

What you'll do

  • Investigate bottlenecks in current systems to enhance communication performance.
  • Design and implement new technologies to speed up AI and HPC workloads.
  • Co-design next-generation platforms by integrating GPU, networking, and software innovations.
  • Develop proofs-of-concept and conduct experiments to evaluate new solutions.
  • Simulate large-scale GPU clusters to assess performance at massive scales.

What we're looking for

  • M.S./Ph.D. in CS/CE or equivalent experience.
  • Expertise in C/C++ programming, debugging, and parallel programming models (MPI, SHMEM).
  • Deep understanding of operating systems, computer architecture, and network fundamentals.
  • Experience with Linux, CUDA programming, and NVIDIA GPUs.
  • Knowledge of high-performance networks like InfiniBand, RoCE, NVLink.

More like this

Similar roles

Senior Deep Learning Communication Architect

Nvidia

Santa Clara, CA +1 39 days ago $184,000$287,500
PyTorch TensorRT-LLM vLLM SGLang C++ Python CUDA OpenCL InfiniBand RoCE MPI NCCL UCX NVSHMEM Dynamo Triton Data Parallelism Pipeline Parallelism Tensor Parallelism Expert Parallelism FSDP

Senior Software Engineer, CUDA Deep Learning Systems

Nvidia

Remote (Santa Clara, CA) +1 45 days ago $184,000$287,500
CUDA Python C++ PyTorch JAX TensorRT vLLM Nemo Megatron MaxText Triton XLA NCCL MPI UCX Docker CI/CD Git GitHub Linux GPGPU DeepLearningFrameworks TransformerModels DistributedComputing ProfilingTools CUDAProgramming LowLevelOptimization NumericalMethods FP8 int8 ReinforcementLearning
Remote

Senior System Software Engineer, GPU Performance

Nvidia

Remote (Santa Clara, CA) 6 days ago $152,000$241,500
C/C++ Python MPI NCCL UCX NVSHMEM Kubernetes SLURM Ansible Docker Infiniband Ethernet CUDA PyTorch TensorFlow RDMA Prometheus Grafana CI/CD
Remote