HPC Engineering Intern (Hybrid) - WashU IT

Washington University in St. Louis

Remote Hybrid Verified listing
Remote, USA · 4480 Clayton, US Posted 11 days ago

At a glance

AI generated

TL;DR

As an HPC Engineering Intern (DevOps) at a leading institution, you will join a dynamic team focused on advancing large-scale computing infrastructure. Your role involves automating and hardening cluster provisioning with Ansible, building custom monitoring dashboards in Datadog to track system health, refining operational procedures for clarity and accuracy, testing and debugging code to ensure reliable HPC workflows, and integrating AI algorithms into existing systems. You will collaborate closely with engineers, receive mentorship, and document your work meticulously. Key skills include proficiency in Ansible, Bash scripting, Python, and experience with tools like GitHub and Datadog. This internship offers hands-on experience in a high-performance computing environment, where you’ll tackle real-world challenges and contribute to cutting-edge projects.

Skills

Ansible Python Bash GitHub Datadog CI/CD Kubernetes Docker Terraform PostgreSQL Prometheus Grafana

What you'll do

  • Convert legacy scripts into Ansible playbooks for Infrastructure as Code.
  • Create custom dashboards in Datadog to surface system health metrics.
  • Update and improve operational procedures and runbooks for clarity and accuracy.
  • Validate and troubleshoot code and automation for reliable HPC workflows.
  • Assist with integrating AI algorithms into existing systems and CI/CD pipelines.

What we're looking for

  • Experience with Ansible for automating and provisioning infrastructure.
  • Proficiency in Bash scripting and Python programming languages.
  • Ability to develop custom dashboards using Datadog for monitoring purposes.
  • Strong analytical skills and independent problem-solving capabilities.
  • Excellent communication and documentation skills for operational procedures.
  • Familiarity with GitHub for version control and collaboration on projects.
  • Experience in performance monitoring and data analysis.

Market check

Salary context

This listing doesn't show a salary. Similar roles on FindRole typically pay $130,750–$225,000.

Peer median band

$130,750$225,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$148,250$213,931

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Washington University in St. Louis

Washington University in St. Louis is a private research university known for excellence in medicine, law, business, engineering, and the arts, affiliated with one of the nation''s top medical schools. Industry: Higher Education & Research

Washington University in St. Louis currently has 2 open roles on FindRole.

Most-posted roles

View all roles at Washington University in St. Louis

More like this

Similar roles

HPC Engineer

Arm Holdings

Austin, Texas, US 17 days ago $130,100$176,000
Python Bash Kubernetes Docker AWS GCP Azure Terraform Ansible IBM Spectrum LSF Prometheus Grafana CI/CD DevOps SRE Infrastructure as Code Slurm Jira Confluence

High Performance Computing (HPC) Engineer

The Federal Reserve

Kansas City, Mo, US 170 days ago $110,300$155,700
Linux Python Bash SLURM ceph GPFS MPI Ansible Salt Puppet GitLab CI/CD CUDA OpenACC Docker Singularity MESOS Kubernetes

HPC Systems Administration Specialist

Argonne National Laboratory

Lemont, Il Usa, US 121 days ago $69,750$108,810
Linux Spack Lmod Singularity Version control systems Compilers GCC Intel LLVM Make CMake Autotools Python CI pipelines YAML Podman MPI CUDA BLAS FFTW

HPC Systems Administration Specialist

Argonne National Laboratory

Lemont, Il Usa, US 158 days ago $69,750$108,810
Linux Spack Lmod Singularity Python CI pipelines Make CMake Autotools GCC Intel Compilers LLVM YAML Podman Git

Senior HPC Performance Engineer

Nvidia

Remote (Us, Or, Remote, US) 42 days ago $184,000$287,500
Fortran C C++ OpenACC OpenMP MPI CUDA Performance_analysis Parallel_programming Linear_algebra Numerical_methods Assembly_language Debugging Porting
Remote

Senior HPC Cluster Engineer

Nvidia

Us, Ca, Santa Clara, US 79 days ago $152,000$241,500
Slurm Kubernetes Python Bash Docker Enroot Prometheus Grafana Linux RHEL Ubuntu MPI NCCL CUDA NVIDIA_GPUs InfiniBand RDMA RoCE Lustre GPFS Ansible MLPerf