Distinguished Software Architect - Deep Learning and HPC Communications

Nvidia

Actively hiring Verified listing
Santa Clara, US Posted 13 days ago $320,000$488,750 / year

At a glance

AI generated

TL;DR

NVIDIA's GPU Communications Libraries and Networking team seeks a Distinguished Software Architect to co-design next-generation data center platforms for Deep Learning and High Performance Computing applications, which demand high-speed interconnects like NVLink, PCIe, Infiniband, and Ethernet. The role involves researching new communication technologies, proposing innovative hardware and software solutions, and ensuring seamless integration with existing software stacks. You will drive the adoption of these technologies across various application domains while collaborating with internal teams and external partners to advance NVIDIA's vision in AI and HPC. Ideal candidates possess a PhD or extensive industry experience, expertise in HPC, parallel programming models, communication runtimes, GPU architecture, and high-performance networking, along with strong skills in C/C++ for systems software development and a proven track record of leadership in relevant industry standards and open-source projects.

Skills

HPC MPI NCCL NVSHMEM UCX CUDA Infiniband Ethernet C C++ PyTorch TensorFlow GPU Networking System_Architecture Parallel_Programming_Models ML_DL_Fundamentals Performance_Optimization Fault_Tolerance Competitive_Assessments

What you'll do

  • Research new communication technologies and expand the GPUDirect technology portfolio.
  • Design innovative solutions for next-gen platforms by co-working with GPU, Networking, and SW architects.
  • Drive adoption of new communication technologies across various application verticals.
  • Keep abreast of latest DL research trends and collaborate with diverse internal and external teams.
  • Conduct detailed technical analysis to inspire changes in communication library features.

What we're looking for

  • PhD in Computer Science or related field with 15+ years of experience.
  • Expertise in HPC, parallel programming models, and communication runtimes.
  • Deep understanding of high-performance networking technologies and topologies.
  • Strong knowledge in ML/DL fundamentals, fault tolerance, and performance analysis.
  • Programming fluency in C/C++ for systems software development.
  • History of industry leadership through patents, publications, and conference talks.

Market check

Salary context

This $320,000–$488,750 range sits above 100% of similar postings on FindRole.

Peer median band

$155,420$257,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$183,912$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Software Architect - Deep Learning and HPC Communications

Nvidia

Remote (Us, Ca, Santa Clara, US) 20 days ago $184,000$287,500
C/C++ MPI NCCL NVSHMEM UCX CUDA Linux InfiniBand RoCE NVLink PyTorch TensorFlow HPC Networking Simulation Quantitative_Modeling SHMEM Parallel_Programming Deep_Learning_Pods
Remote

Senior GPU Architect, Deep Learning

Nvidia

Us, Ca, Santa Clara, US 140 days ago $184,000$287,500
C C++ Perl Python CUDA TensorFlow PyTorch NVIDIA_GPU_Architecture Deep_Learning Parallel_Computing Computer_Architecture CI/CD MESOS Kubernetes Docker Prometheus Grafana PostgreSQL Redis

Senior Deep Learning Computer Architect

Nvidia

Us, Ca, Santa Clara, US 140 days ago $184,000$287,500
C++ Python CUDA PyTorch GPU ComputerArchitecture DeepLearningKernels LLMWorkloads PerformanceAnalysis ParallelizationStrategies FusionStrategies

Senior Software Engineer, CUDA Deep Learning Systems

Nvidia

Remote (Us, Ca, Santa Clara, US) 15 days ago $184,000$287,500
CUDA Python C++ PyTorch JAX TensorRT vLLM Nemo Megatron MaxText Triton XLA NCCL MPI UCX Docker CI/CD Git GitHub Linux PostgreSQL Prometheus Grafana
Remote