Senior Software Architect - Deep Learning and HPC Communications

Nvidia

Remote Actively hiring
Remote, USA · Santa Clara, CA · Westford, MA · Austin, TX · Durham, NC Posted 20 days ago $184,000$287,500 / year

At a glance

AI generated

TL;DR

As a Senior Software Architect on NVIDIA’s GPU Communications Libraries and Networking team, you will play a pivotal role in advancing the state-of-the-art by co-designing next-generation data center platforms and scalable communications software. Your responsibilities include identifying bottlenecks in current systems to improve communication performance, designing innovative solutions for hardware and software integration, and conducting experiments with large-scale GPU clusters. You will leverage C/C++ programming skills alongside experience with parallel programming models like MPI and SHMEM, as well as deep knowledge of operating systems and network architecture. This role demands expertise in high-performance networks such as InfiniBand and NVLink, familiarity with CUDA programming, and a solid understanding of AI and HPC workloads.

Skills

C/C++ MPI NCCL NVSHMEM UCX CUDA Linux InfiniBand RoCE NVLink PyTorch TensorFlow HPC Networking Simulation Quantitative_Modeling SHMEM Parallel_Programming Deep_Learning_Pods

What you'll do

  • Investigate bottlenecks in current systems to improve communication performance.
  • Design and implement new technologies to accelerate AI and HPC workloads.
  • Explore innovative HW and SW solutions for next-gen platforms through co-design efforts.
  • Build proofs-of-concept and conduct experiments to evaluate new innovations.
  • Use simulation to assess the performance of large GPU clusters at scale.

What we're looking for

  • M.S./Ph.D. in CS/CE or equivalent experience required.
  • Expertise in C/C++ programming and debugging skills.
  • Experience with parallel programming models (MPI, SHMEM) and communication runtimes like MPI, NCCL, NVSHMEM.
  • Deep understanding of operating systems, computer architecture, and network fundamentals for AI/HPC workloads.
  • Strong Linux proficiency and experience with high-performance networks such as InfiniBand, RoCE, NVLink.
  • Knowledge of deep learning frameworks (PyTorch, TensorFlow) and HPC applications.
  • Proven ability to collaborate effectively in a multi-national environment.

Market check

Salary context

This $184,000–$287,500 range sits above 79% of similar postings on FindRole.

Peer median band

$170,850$262,600

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$196,412$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Distinguished Software Architect - Deep Learning and HPC Communications

Nvidia

Us, Ca, Santa Clara, US 13 days ago $320,000$488,750
HPC MPI NCCL NVSHMEM UCX CUDA Infiniband Ethernet C C++ PyTorch TensorFlow GPU Networking System_Architecture Parallel_Programming_Models ML_DL_Fundamentals Performance_Optimization Fault_Tolerance Competitive_Assessments

Senior Deep Learning Computer Architect

Nvidia

Us, Ca, Santa Clara, US 140 days ago $184,000$287,500
C++ Python CUDA PyTorch GPU ComputerArchitecture DeepLearningKernels LLMWorkloads PerformanceAnalysis ParallelizationStrategies FusionStrategies

Senior Deep Learning Software Engineer

Nvidia

US 85 days ago $224,000$356,500
Python PyTorch JAX CUDA TensorRT NVIDIA_TensorRT_LLM GPU_optimization CUTLASS Triton Deep_learning_frameworks Performance_analysis GPU_architecture High_performance_computing Model_inference Inference_optimization

Senior Deep Learning Communication Architect

Nvidia

Us, Ca, Santa Clara, US 9 days ago $184,000$287,500
PyTorch TensorRT-LLM vLLM SGLang C++ Python CUDA OpenCL InfiniBand RoCE MPI NCCL UCX UCC NVSHMEM Data Parallelism Pipeline Parallelism Tensor Parallelism Expert Parallelism FSDP Disaggregated Serving Dynamo Triton