Infrastructure Systems Engineer

Nvidia

Hybrid

Quick summary

Work type
Hybrid
Location
Santa Clara, CA
Salary
$124,000–$195,500 / yr
Posted
3 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $165k
This role $160k
$115k most similar roles pay here $207k

This role pays less than 55% of similar roles. Most pay $132,372–$197,062 — the shaded band above. At the midpoint, this role pays about $160k versus about $165k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 950 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 939 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Infrastructure Systems Engineer

NVIDIA’s Kernel Infrastructure team seeks a Hands-On Systems Engineer to manage environment readiness and long-term health for next-generation GPU platforms. This role involves early production bringup and tuning, triaging complex issues, fleet health monitoring, and standardization of system baselines. The ideal candidate will have 3+ years in systems engineering with expertise in Linux and Windows administration, scripting (Shell, Python), and automation tools like Ansible. Proficiency with Slurm or Kubernetes is preferred, along with strong communication skills to collaborate across teams. Experience managing HPC clusters at scale and configuring early hardware prototypes is a plus, as is a background in Computer Engineering or related fields.

What you'll do

  • Drive early-stage engineering systems to a performance-ready state through firmware/VBIOS flashing and system tuning.
  • Triage complex system issues by coordinating directly with firmware, hardware design, and platform teams.
  • Monitor and optimize fleet health by implementing proactive checks and manual recoveries when needed.
  • Establish and maintain "golden" system baselines for stable engineering execution as products evolve.
  • Manage hardware inventory and allocation to improve utilization across engineering teams.

What we're looking for

  • 3+ years of experience in systems engineering or infrastructure operations with early-stage platforms.
  • Deep expertise in Linux and Windows system administration with strong debugging skills across hardware-to-software stack.
  • Proficiency in scripting languages like Shell, Python, and automation tools such as Ansible.
  • Hands-on experience managing HPC clusters at scale and configuring bring-up systems for early hardware prototypes.
  • Strong problem-solving abilities and a collaborative mindset for cross-functional team coordination.
  • Degree in Computer Engineering, Electrical Engineering, or related field; equivalent experience also considered.

More like this

Similar roles

Infrastructure Systems Engineer

Lockheed Martin

Moorestown, NJ 3 days ago $75,000$135,961
Linux Red_Hat_Enterprise_Linux TCP/IP Multicast VLANs Routing_Protocols Network_Switches Routers SELinux iptables FreeIPA Bash Python Ansible Kubernetes Docker CI/CD AWS Azure GCP Git Jenkins Terraform PostgreSQL MongoDB

Infrastructure Systems Engineer

General Dynamics

Pittsfield, MA 3 days ago $100,219$111,180
IBM_DOORS MATLAB Simulink Windows_PowerShell Cisco_Networking Linux Kubernetes MongoDB Virtualization Containerization Cybersecurity_Tools Database_Administration
Hybrid

Infrastructure Engineer

Invenergy

Chicago 8 days ago $137,000$160,000
Microsoft 365 PowerShell Terraform Bicep Azure Intune ServiceNow CI/CD Zero Trust Architecture NERC-CIP AI-Assisted Tools PostgreSQL MySQL MSSQL Kubernetes Docker Prometheus Grafana

Infrastructure Engineer

Berkeley Research Group

Remote 2 days ago $110,000$160,000
Azure PowerShell ActiveDirectory RBM AzureAD Intune SCCM ExchangeOnline Teams SharePointOnline Jamf GroupPolicy DNS DHCP WindowsServer AzureVirtualMachines AzureNetworking AzureMonitor LogAnalytics Terraform Git CI/CD
Remote