HPC Engineer

Arm Holdings

Hybrid Actively hiring
Austin, TX Posted 17 days ago $130,100$176,000 / year

At a glance

AI generated

TL;DR

As an HPC Operations Engineer at Engineering IT within Arm, you will join a dynamic team responsible for maintaining and enhancing high-performance computing platforms that are vital for engineering productivity. Your daily tasks will include operating and improving IBM Spectrum LSF job scheduling services, developing automation tools with Python and Bash to enhance user experience, and collaborating closely with various infrastructure teams to ensure robust service delivery. You’ll also support cloud HPC integration across multiple providers like AWS, GCP, Azure, and OpenStack while contributing to modernization initiatives involving Kubernetes and Infrastructure as Code practices. This role demands strong Linux system administration skills, scripting abilities, and familiarity with monitoring tools such as Prometheus and Grafana, alongside a solid understanding of DevOps principles and Agile methodologies.

Skills

Python Bash Kubernetes Docker AWS GCP Azure Terraform Ansible IBM Spectrum LSF Prometheus Grafana CI/CD DevOps SRE Infrastructure as Code Slurm Jira Confluence

What you'll do

  • Operate and enhance IBM Spectrum LSF job scheduling services for HPC platforms.
  • Automate operational tasks and develop self-service capabilities using Python and Bash.
  • Conduct root cause analysis and restore service during production incidents in HPC environments.
  • Work with engineering users to optimize workload performance and resource utilization on HPC systems.
  • Support cloud HPC integration initiatives across AWS, GCP, Azure, OpenStack, and hybrid setups.

What we're looking for

  • Experience operating HPC environments and job schedulers like IBM Spectrum LSF.
  • Strong Linux system administration skills with RHEL or similar distributions.
  • Proficient in scripting and automation using Python, Bash, or equivalent languages.
  • Background in supporting production infrastructure and conducting root cause analysis.
  • Familiarity with monitoring tools such as Prometheus, Grafana, and observability platforms.
  • Experience with public cloud platforms including AWS, GCP, Azure, and Kubernetes services.
  • Understanding of DevOps principles, SRE practices, and CI/CD pipeline management.

Market check

Salary context

This $130,100–$176,000 range sits above 26% of similar postings on FindRole.

Peer median band

$152,000$241,300

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$147,625$225,650

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Arm Holdings

Arm Holdings plc is a leading British semiconductor and software design firm, established in 1990 and recognized for developing energy-efficient processor architectures that power nearly all smartphones and a vast range of IoT and computing devices.

Arm Holdings currently has 34 open roles on FindRole.

Listed pay typically runs $184,500–$249,600 across 34 roles with salary data.

Most-posted roles

View all roles at Arm Holdings

More like this

Similar roles

Senior HPC Storage Architect & Engineer

Lam Research

Fremont, Ca,Us, US 136 days ago $114,000$253,000
Lustre GPFS/Spectrum Scale VAST Data WEKA NetApp ONTAP FlexCache AWS Azure GCP InfiniBand RoCE NVMe-over-Fabrics SLURM xCAT Warewulf Ansible Terraform Python YAML Kubernetes CSI S3 IaC CI/CD

Senior HPC Cluster Engineer

Nvidia

Us, Ca, Santa Clara, US 79 days ago $152,000$241,500
Slurm Kubernetes Python Bash Docker Enroot Prometheus Grafana Linux RHEL Ubuntu MPI NCCL CUDA NVIDIA_GPUs InfiniBand RDMA RoCE Lustre GPFS Ansible MLPerf

Senior HPC Performance Engineer

Nvidia

Remote (Us, Or, Remote, US) 42 days ago $184,000$287,500
Fortran C C++ OpenACC OpenMP MPI CUDA Performance_analysis Parallel_programming Linear_algebra Numerical_methods Assembly_language Debugging Porting
Remote

High Performance Computing (HPC) Engineer

The Federal Reserve

Kansas City, Mo, US 170 days ago $110,300$155,700
Linux Python Bash SLURM ceph GPFS MPI Ansible Salt Puppet GitLab CI/CD CUDA OpenACC Docker Singularity MESOS Kubernetes

HPC Systems Administration Specialist

Argonne National Laboratory

Lemont, Il Usa, US 158 days ago $69,750$108,810
Linux Spack Lmod Singularity Python CI pipelines Make CMake Autotools GCC Intel Compilers LLVM YAML Podman Git

HPC Systems Administration Specialist

Argonne National Laboratory

Lemont, Il Usa, US 121 days ago $69,750$108,810
Linux Spack Lmod Singularity Version control systems Compilers GCC Intel LLVM Make CMake Autotools Python CI pipelines YAML Podman MPI CUDA BLAS FFTW