Senior Full Stack Software Engineer - DGX Cloud

Nvidia

Remote Actively hiring Verified listing
Remote, US · Raleigh, NC · Austin, TX · Seattle, WA · San Jose, CA Posted 11 days ago $224,000$356,500 / year

At a glance

AI generated

TL;DR

NVIDIA seeks experienced software engineers for its AI Infrastructure team to scale up GPU-based systems. You will design and develop a distributed platform to monitor and optimize GPU performance across large clusters, ensuring reliability and maximum efficiency in production environments. This role involves working with React, Web Components, TypeScript, Golang, PostgreSQL, Temporal, Bazel, and Kubernetes, collaborating closely with cross-functional teams to enhance AI workloads. Ideal candidates have over 12 years of software engineering experience, including full-stack development and consumer product shipping, with a strong background in cluster management systems like Kubernetes. Proficiency in React, TypeScript/JavaScript, Golang, and SQL databases is essential, along with expertise in asynchronous workflows and operational excellence for maintaining robust infrastructure.

Skills

React TypeScript JavaScript Golang PostgreSQL Kubernetes SQL CI/CD Bazel Temporal Slurm Docker Prometheus Git Linux Python GraphQL

What you'll do

  • Design and develop scalable platforms for identifying and resolving issues with non-performant GPU assets.
  • Ensure AI clusters run reliably and consistently by evaluating system failures and improving services.
  • Work across the product stack including React, Web Components, TypeScript, Golang, PostgreSQL, Kubernetes.
  • Collaborate with multi-functional teams to coordinate effectively across organizational boundaries on large-scale systems.
  • Manage and automate large-scale distributed systems using tools like Kubernetes and Slurm.
  • Maintain reliable and performant infrastructure through proven operational excellence.
  • Use LLMs responsibly while understanding the risks of consuming their output blindly.

What we're looking for

  • Significant software engineering experience with large-scale production systems (12+ years).
  • Proficiency in React, TypeScript/JavaScript, and Golang.
  • Experience building full-stack consumer-facing products (6+ years).
  • Deep understanding of cluster management systems like Kubernetes and Slurm.
  • Strong operational skills for maintaining reliable infrastructure.
  • BS in Computer Science or Engineering or equivalent experience.

Market check

Salary context

This $224,000–$356,500 range sits above 98% of similar postings on FindRole.

Peer median band

$141,900$225,100

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$144,750$223,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Principal Software Engineer - DGX Cloud

Nvidia

Us, Ca, Santa Clara, US 30 days ago $272,000$431,250
Python Kubernetes Go AWS Prometheus Grafana OpenTelemetry Docker CI/CD Java CUDA cuDNN

Senior Cloud Full Stack Engineer

Humana

Remote (Remote Us, US) 10 days ago $106,900$147,000
Azure Kubernetes Docker VueJS TypeScript NodeJs Java Spring Spring Boot JSON RESTful APIs Microservices Angular CI/CD Terraform PostgreSQL MSSQL
Remote

Senior Full Stack Software Engineer

Fiserv

Sunnyvale, California, US 12 days ago $140,000$210,000
Java Spring Boot React TypeScript Python MySQL MongoDB Google Cloud Pub/Sub Kubernetes CI/CD REST APIs AI coding assistants Automated testing Code quality analysis Documentation generation Operational intelligence tools Agile methods

Senior Full Stack Software Engineer

Brico

US 27 days ago
Node.js Python React Django AWS GCP Azure Git CI/CD PostgreSQL Microservices Terraform Docker Kubernetes Prometheus Grafana