Software Engineering Manager - GPU Communications Libraries

Nvidia

Actively hiring
Santa Clara, US Posted 15 days ago $184,000$287,500 / year

At a glance

AI generated

TL;DR

Join the GPU Communications Libraries and Networking team at NVIDIA as a technical leader managing NVSHMEM and UCX libraries, where you will lead and mentor an engineering team responsible for planning and executing projects while ensuring high-quality performance of communication libraries critical to Deep Learning and HPC applications. You will participate in feature design and implementation, collaborate with internal and external partners to define product roadmaps, and continuously improve processes and practices. Ideal candidates have over 10 years of experience in software development, specializing in HPC networking or system software, with a strong background in C/C++ programming on Linux, parallel programming models like MPI and SHMEM, and high-performance networking technologies such as InfiniBand and Ethernet.

Skills

C/C++ Linux CUDA MPI OpenMP OpenACC pthread RDMA InfiniBand RoCE Ethernet EFA NCCL NVSHMEM UCX PyTorch TensorFlow HPC ML DL

What you'll do

  • Lead and mentor a team responsible for developing communication libraries like NVSHMEM and UCX.
  • Design and implement new features in collaboration with internal teams and external partners.
  • Understand and incorporate use cases and requirements from internal and external stakeholders.
  • Continuously review and improve processes to ensure efficient execution of projects.
  • Maintain strong understanding of HPC networking, system software fundamentals, and performance optimizations.

What we're looking for

  • 10+ years of experience in software industry with specialization in HPC networking or system software.
  • 4+ years of management experience leading engineering teams.
  • BS/MS/Ph.D. in CS, CE, EE or equivalent technical field.
  • Proven track record developing complex systems software through full product lifecycle.
  • Expertise in C/C++ programming and Linux debugging.
  • Experience with parallel programming models (MPI, SHMEM) and communication runtimes (NCCL, UCX).
  • Knowledge of RDMA, high-performance networking technologies, and HPC fundamentals.

Market check

Salary context

This $184,000–$287,500 range sits above 80% of similar postings on FindRole.

Peer median band

$152,250$241,500

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$167,000$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

System Software Engineer, GPU Development Tools

Nvidia

Us, Ca, Santa Clara, US 53 days ago $124,000$195,500
C++ Python CUDA DX OpenGL Vulkan Chip Simulation Virtual Machines Containers Distributed Programming Object-Oriented Design Patterns CI/CD

System Software Engineer, GPU Development Tools

Nvidia

Us, Ca, Santa Clara, US 43 days ago $152,000$241,500
C++ Python CUDA DX OpenGL Vulkan Object-Oriented_Design_Patterns Chip_Simulation System_Simulation Virtual_Machines Containers Distributed_Programming

GPU Software Engineer

Qualcomm

San Diego, Ca,Us, US 11 days ago $98,900$148,300
C C++ Python Perl GPU APIs pre-silicon tests post-silicon tests design documentation team collaboration graphics programming

Senior Staff Engineer, GPU Software Architecture

Samsung Electronics

Remote (3900 N Capital Of Texas Hwy, Austin, Tx, Usa, US) 85 days ago $180,200$297,200
C C++ Python Vulkan DirectX Metal HLSL GLSL OpenCL CUDA Unreal Unity Linux Android OpenGL 3D graphics GPU hardware ray tracing rasterization linear algebra multi-threaded debugging performance profiling parallel programming game engines offline compiler JIT compiler
Remote

Manager, Solutions Architecture – GPU and Networking Systems

Nvidia

Remote (Us, Ca, Santa Clara, US) 32 days ago $224,000$356,500
NVIDIA CUDA C/C++ Linux InfiniBand RoCE ARM-based CPUs Ethernet NICs Kubernetes Docker CI/CD Python PostgreSQL Prometheus Grafana AWS Azure Google Cloud Platform Terraform
Remote