System Software Engineer, Distributed Systems

Nvidia

Quick summary

Work type
On-site
Location
Santa Clara, CA
Salary
$152,000–$241,500 / yr
Posted
3 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $171k
This role $197k
$114k most similar roles pay here $255k

This role pays more than 71% of similar roles. Most pay $137,006–$204,837 — the shaded band above. At the midpoint, this role pays about $197k versus about $171k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 997 open roles on FindRole.

Listed pay typically runs $168,000–$270,250 across 984 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · System Software Engineer, Distributed Systems

The VLSI Productivity and Infrastructure team seeks a senior systems engineer to support over 1000 chip design engineers by building robust, long-shelf-life tools on bare-metal Linux hosts. This role involves designing and delivering core components of next-generation productivity platforms, developing reliable userspace infrastructure for large-scale engineering workflows, and enhancing orchestration around IBM LSF. Key responsibilities include state coordination via NFS without privileged operations, converting legacy codebases to modern languages like Go, and improving performance and reliability across Linux and Kubernetes environments. The ideal candidate has 5+ years of experience in production software development with a focus on distributed systems, strong Linux fundamentals, and hands-on experience with shared filesystems at scale and batch job scheduling.

What you'll do

  • Design and deliver core components for next-generation productivity platforms.
  • Develop reliable userspace infrastructure for long-running workflows on bare-metal Linux.
  • Build state coordination over NFS with atomicity, idempotency, and partial-write recovery.
  • Improve orchestration around IBM LSF for submission, tracking, retries, and log capture.
  • Convert legacy codebases incrementally into modern systems using Go or similar languages.

What we're looking for

  • 5+ years experience developing and operating production software in Go or Python
  • Strong background in Linux fundamentals including processes, filesystems, concurrency, and debugging
  • Expertise in building long-runtime automation on shared compute clusters using batch schedulers
  • Ability to translate high-level goals into a safe delivery plan with measurable outcomes
  • Experience with distributed systems focusing on failures, retries, backoff strategies, and idempotency
  • Hands-on experience with NFS or coordination patterns on eventually-consistent storage at scale

More like this

Similar roles

Principal Software Engineer, Distributed Systems

Alteryx

Remote (Northern California, Usa - Remote, US) 20 days ago $215,000$300,000
Kubernetes Java Python Node.js Kafka Redis API design Docker AWS Azure GCP Terraform CI/CD Prometheus Grafana GitOps Service Mesh Observability SRE DevOps Scalability Security Architecture Review Board
Remote