Lead Systems Software Test Engineer, CSP Engagements

Nvidia

Quick summary

Work type
On-site
Location
Santa Clara, CA
Salary
$184,000–$287,500 / yr
Posted
3 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $190k
This role $236k
$137k most similar roles pay here $304k

This role pays more than 92% of similar roles. Most pay $168,240–$211,200 — the shaded band above. At the midpoint, this role pays about $236k versus about $190k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 942 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 931 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Lead Systems Software Test Engineer, CSP Engagements

NVIDIA is hiring a Senior Systems Software Test Engineer to join its Cloud Service Provider Engagements team, focusing on validating the ML software stack for datacenter products like GB200 and Vera Rubin. This role involves defining test strategies, reproducing customer bugs, and collaborating with development teams to ensure release readiness through comprehensive validation from concept to deployment. The ideal candidate will have experience in system testing, QA, and platform bring-up for complex hardware-software systems, along with strong skills in Linux environments, shell scripting, Python automation, and CI workflows. Familiarity with cloud and cluster-level deployments, deep learning workloads, and ML Ops is a plus, as the role requires managing large datasets, developing tooling for efficient debugging, and working closely with CSP teams to ensure environment readiness and performance benchmarking.

What you'll do

  • Define test strategies and validation plans for CSP integration milestones.
  • Reproduce, characterize, and triage customer bugs in their environment.
  • Validate fixes and release updates against deployed CSP software modules.
  • Partner with NVIDIA development teams to drive root-cause analysis and confirm release readiness.
  • Manage large datasets of testing output and develop tooling for efficient retrieval and reporting.
  • Work with customers to localize problems using targeted reproduction steps for stress and edge-case testing.
  • Run performance benchmarks for training and inference, collaborating on validation for customer issues.

What we're looking for

  • Extensive experience in validating complex hardware-software systems for datacenter products.
  • Deep understanding of server platforms, firmware, drivers, OS integration, and large-scale clusters.
  • Proficient in debugging issues across multiple layers including hardware, firmware, software, and networking.
  • Strong skills in analyzing logs, telemetry, diagnostic outputs, and system health signals.
  • Hands-on experience with Linux environments, shell scripting, Python automation, CI workflows.
  • Ability to create test plans, regression suites, validation reports, and defect documentation.
  • BS or MS in Computer Engineering, Computer Science, or related field (or equivalent experience).
  • Experience in cloud deployment, cluster-level operations, and deep learning workload automation.

More like this

Similar roles

Lead Software Systems Engineer

Boeing

Kent, WA 4 days ago $158,100$213,900
Java C++ Agile SAFe Maven Git Jenkins DOORS MBSE MSOSA Cameo Jira Confluence Linux Windows Requirements Management Configuration Management Test Management Embedded Systems Sensor Systems Avionics Command & Control Architectures

Software Systems Engineer, Senior/Lead

Boeing

El Segundo, CA 3 days ago $158,500$213,900
Python Java C++ Kubernetes Docker GitLab CI/CD SQL NoSQL Kafka ActiveMQ CCSDS DevSecOps Agile AWS PostgreSQL Messaging Systems Data Streaming Technologies

Lead Software Integration Engineer

General Dynamics

Manassas, VA 4 days ago $95,384$105,817
Linux CUDA CI/CD AI Python C++ GPU Kubernetes Docker Git Jenkins PostgreSQL SonarQube Prometheus Grafana

Lead Software Engineer

The Walt Disney Company

Remote 68 days ago $152,200$204,100
AWS Kafka RabbitMQ .NET Core .NET 6+ Spring Boot SQL RESTful APIs OAuth 2.0 C# Java PostgreSQL S3 RDS VPC Docker CI/CD Snowflake Lambda Prometheus
Remote

Lead Software Engineer

T. Rowe Price

Owings Mills, MD +6 67 days ago $145,000$247,000
AWS Java Python JavaScript DevOps CI/CD Terraform Docker Kubernetes Informatica Apache Spark PostgreSQL SQL Git Jenkins Ansible Prometheus Grafana
Hybrid

Lead Software Engineer

The Walt Disney Company

Glendale, CA +1 27 days ago $155,700$208,700
Java SpringBoot DynamoDB Redis Apache Kafka AWS Terraform Docker Kubernetes Microservice architecture CI/CD Prometheus Grafana
Hybrid