System Software Engineer, Distributed Systems

Nvidia

Actively hiring
Santa Clara, CA Posted 64 days ago $152,000$241,500 / year

At a glance

AI generated

TL;DR

The VLSI Productivity and Infrastructure team seeks a versatile systems engineer to support over 1000 chip design engineers by building robust, long-shelf-life tools on bare-metal Linux hosts. This role involves designing and delivering core components of next-generation productivity platforms, developing reliable userspace infrastructure for large-scale engineering workflows, and enhancing orchestration around IBM LSF. Key responsibilities include state coordination via NFS without privileged operations, converting legacy codebases to modern languages like Go through incremental migration, and improving performance and reliability across Linux and Kubernetes environments. The ideal candidate has 5+ years of experience in production software development with a strong background in distributed systems, Linux fundamentals, and operational excellence, along with hands-on experience in shared filesystems at scale and batch job scheduling.

Skills

Go Python Linux NFS IBM LSF Docker Kubernetes Perl CI/CD Prometheus Grafana Git Bash SQL Redis Zookeeper Consul Elasticsearch Jenkins Ansible Terraform AWS Google Cloud Platform Azure

What you'll do

  • Design and deliver core components for next-generation productivity platforms.
  • Develop reliable userspace infrastructure for long-running workflows on bare-metal Linux.
  • Build state coordination over NFS with atomicity, idempotency, and partial-write recovery.
  • Enhance orchestration around IBM LSF for submission, tracking, retries, and log capture.
  • Convert legacy codebases incrementally into modern systems using Go, ensuring observability.
  • Debug and improve performance and reliability across Linux and Kubernetes environments.
  • Translate high-level goals into safe delivery plans with instrumentation and staged rollouts.

What we're looking for

  • B.S. in Computer Science or Electrical Engineering (or equivalent experience)
  • 5+ years developing and operating production software in Go and/or Python
  • Strong Linux fundamentals including processes, filesystems, permissions, concurrency
  • Experience building long-runtime automation on shared compute clusters
  • Solid distributed-systems thinking with focus on failures and operational rigor
  • Ability to translate high-level goals into a safe delivery plan with measurable outcomes
  • Hands-on experience with shared filesystems at scale (NFS) and batch job scheduling

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $178k
This role $197k
$111k most similar roles pay here $255k

This role pays more than 65% of similar roles. Most pay $142,400–$214,500 — the shaded band above. At the midpoint, this role pays about $197k versus about $178k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 824 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 812 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Principal Software Engineer, Distributed Systems

Alteryx

Remote (Northern California, Usa - Remote, US) 5 days ago $215,000$300,000
Kubernetes Java Python Node.js Kafka Redis API design Docker AWS Azure GCP Terraform CI/CD Prometheus Grafana GitOps Service Mesh Observability SRE DevOps Scalability Security Architecture Review Board
Remote

Systems Software Engineer

Danaher Corporation

Vista, CA 26 days ago $84,000$120,000
AI Python CMake Linux DevOps CI/CD Configuration Management Agile Scrum FDA ISO Design Control Quality Management System

Software and Systems Engineer

Booz Allen Hamilton

Chantilly, VA 68 days ago $69,400$158,000
Agile Jira Confluence Visio Cloud software development Risk management processes Requirements traceability Atlassian tools MBSERequirements traceability

Software Systems Engineer

Broadcom

CA 91 days ago $141,300$226,000
Kubernetes Docker Go C++ Python Git CI/CD Terraform AWS Azure GCP Prometheus Grafana PostgreSQL Redis MongoDB GraphQL REST Swagger OAuth JWT

Lead Software Systems Engineer

Boeing

Tukwila, WA 15 days ago $171,700$232,300
Python C++ Java MIL-STD-6016 Link-16 MIDS TTNT TDMA JTRS IER Agile CI/CD Tactical Data Links Command and Control Systems Software Development Lifecycle Quality Assurance

Software and Systems Engineer, Mid

Booz Allen Hamilton

Chantilly, VA 7 days ago $69,400$158,000
Agile Scrum Jira Confluence Visio Terraform AWS Kubernetes Docker Python CI/CD PostgreSQL Git GitHub Prometheus Grafana