HPC Storage Systems Team Lead

Argonne National Laboratory

Lemont, Il Usa, US Posted 160 days ago $116,938$182,424 / year

At a glance

AI generated

TL;DR

As the HPC Storage Systems Team Lead at Argonne’s Leadership Computing Facility (ALCF), you will provide technical leadership and planning for disk storage and file systems supporting supercomputing resources. Your daily tasks include overseeing hardware and software components, designing future machine upgrades, and mentoring team members on emerging technologies. You’ll integrate parallel file systems like Lustre and Spectrum Scale with compute clusters using advanced networking protocols such as InfiniBand and Slingshot. This role requires deep expertise in Linux administration, storage hardware, and protocols, along with proficiency in scripting languages like Python and Bash for automation. Ideal candidates have a Master’s degree plus 6 years or Bachelor’s degree plus 10 years of experience in computer science or engineering, extensive leadership experience in multi-PB scale projects, and the ability to engage effectively with stakeholders across various domains.

Skills

Linux Python Bash Lustre Spectrum Scale NVMe-oF InfiniBand Slingshot NFS HPSS VAST WEKA DDN Spectra Logic High-speed networking HPC clusters System automation CI/CD

What you'll do

  • Lead the technical operation and maintenance of disk storage and file systems for supercomputing resources.
  • Design and plan future machine upgrades to improve disk storage and file system services.
  • Oversee design, testing, and deployment of storage solutions while mentoring team members on new technologies.
  • Integrate parallel file systems with compute clusters using advanced networking technologies like InfiniBand and Slingshot.
  • Provide strategic tuning for storage configurations, including NVMe-oF, metadata management, and tiering strategies.

What we're looking for

  • Master’s degree and 6+ years of experience in computer science or engineering.
  • Lead team to integrate storage solutions with compute clusters using advanced networking technologies.
  • Expertise in tuning configurations for NVMe-oF, metadata management, and tiering strategies.
  • Experience designing, planning, and implementing multi-PB scale storage systems.
  • Deep knowledge of Linux administration, storage hardware, protocols, and vendor solutions.
  • Proficiency in scripting languages (Python, Bash) for system automation and troubleshooting.
  • Ability to develop goals and objectives focused on the success of HPC storage team.

Market check

Salary context

This $116,938–$182,424 range sits above 32% of similar postings on FindRole.

Peer median band

$142,220$225,100

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$142,400$226,400

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Argonne National Laboratory

Argonne National Laboratory is a multidisciplinary science and engineering research center sponsored by the U.S. Department of Energy, conducting research in energy, environment, and national security. Industry: Scientific Research & National Laboratories

Argonne National Laboratory currently has 6 open roles on FindRole.

Listed pay typically runs $78,024–$121,718 across 6 roles with salary data.

Most-posted roles

View all roles at Argonne National Laboratory

More like this

Similar roles

HPC Systems Administration Specialist

Argonne National Laboratory

Lemont, Il Usa, US 157 days ago $69,750$108,810
Linux Spack Lmod Singularity Python CI pipelines Make CMake Autotools GCC Intel Compilers LLVM YAML Podman Git

HPC Systems Administration Specialist

Argonne National Laboratory

Lemont, Il Usa, US 120 days ago $69,750$108,810
Linux Spack Lmod Singularity Version control systems Compilers GCC Intel LLVM Make CMake Autotools Python CI pipelines YAML Podman MPI CUDA BLAS FFTW

Senior HPC Storage Engineer

Nvidia

Us, Ca, Santa Clara, US 67 days ago $184,000$287,500
Python Docker Ceph Weka.io Vast Lustre GPFS CUDA NCCL PyTorch TensorFlow Bash CentOS RHEL Ubuntu SDN MLPerf NVIDIA GPUs HDDs SSDs NVMe

Senior HPC Storage Architect & Engineer

Lam Research

Fremont, Ca,Us, US 135 days ago $114,000$253,000
Lustre GPFS/Spectrum Scale VAST Data WEKA NetApp ONTAP FlexCache AWS Azure GCP InfiniBand RoCE NVMe-over-Fabrics SLURM xCAT Warewulf Ansible Terraform Python YAML Kubernetes CSI S3 IaC CI/CD

HPC Application Manager

Leidos

10469 Wright Patterson Afb Oh, US 21 days ago $69,550$125,725
Linux Shell scripting ServiceNow MATLAB CMake CUDA KOKKOS PBS SLURM InfiniBand Spack GitLab Tecplot FieldView Pointwise Python PostgreSQL CI/CD

Senior Staff, IT Storage Engineer

Samsung Electronics

Remote (3655 N 1St St, San Jose, Ca, Usa, US) 9 days ago $180,200$297,200
NetAppONTAP Linux Python Ansible Terraform ONTRAPAPIO NFS XFS ZFS ext4 StorageGRID FlexGroup SnapMirror FabricPool QoS EDA HPC Lustre GPFS BeeGFS AWSFSxforNetAppONTAP AzureNetAppFiles SystemTap eBPF perf NFSv3 NFSv4 LSF Slurm GridEngine
Remote