Head of High Availability Systems Engineering

Citi

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$170,000–$300,000 / yr
Posted
2 days ago
Closes
Jun 30, 2026

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $187k
This role $235k
$130k most similar roles pay here $318k

This role pays more than 80% of similar roles. Most pay $150,056–$223,700 — the shaded band above. At the midpoint, this role pays about $235k versus about $187k for comparable roles.

Based on 239 similar postings.

Employer

About Citi

Citi is one of the world’s most trusted financial institutions, proudly serving millions of customers across the United States.

Citi currently has 397 open roles on FindRole.

Listed pay typically runs $125,760–$188,640 across 367 roles with salary data.

Most-posted roles

View all roles at Citi

At a glance

TL;DR · Head of High Availability Systems Engineering

As the Head of High Availability Systems Engineering at Citi, you will lead a globally distributed team responsible for maintaining and evolving mission-critical infrastructure that supports enterprise-grade operations. Your day-to-day involves overseeing the installation, configuration, and maintenance of z/TPF, Stratus VOS, and I-Series IOS environments, while also driving performance optimization and capacity planning to ensure system reliability. You will champion security policies, manage disaster recovery solutions, and collaborate with cross-functional teams to enhance operational efficiency. The role requires deep expertise in mainframe technologies such as z/TPF administration, IBM Security Portal, and storage replication, alongside strong leadership skills for talent development and strategic visioning. This position is ideal for someone passionate about high-stakes engineering challenges within a large-scale financial institution.

What you'll do

  • Lead and mentor a global team of z/TPF, Stratus, and I-Series Systems Programmers.
  • Oversee installation, configuration, maintenance, and upgrades of large-scale HA infrastructure.
  • Proactively monitor system performance to identify and resolve bottlenecks for reliability.
  • Champion comprehensive security policies and maintain compliance with regulatory standards.
  • Architect and maintain enterprise-grade disaster recovery solutions for critical systems.
  • Collaborate with cross-functional teams to deliver integrated, resilient technology solutions.

What we're looking for

  • At least 8 years of hands-on z/TPF Administration experience in large-scale production environments.
  • Proven track record of leading and scaling global engineering teams in high-stakes environments.
  • Minimum 15+ years of overall experience in systems engineering or infrastructure roles.
  • Expert-level understanding of disaster recovery architectures and storage replication technologies.
  • Demonstrated proficiency with Stratus VOS and I-Series IOS operating systems and associated toolsets.
  • Strong working knowledge of mainframe hardware, software ecosystems, including IBM Security Portal.

More like this

Similar roles

Director, Reliability Engineering

Johnson & Johnson

Remote (Us328 Ca Santa Clara - 5490 Great America Pkwy, US) 18 days ago $172,000$297,850
Electrical Engineering Mechanical Engineering R&D Engineering Reliability Engineering IEC60601 IEC61010 21CFR820 ISO13485 Robotics Hardware Design Root Cause Analysis Risk Management Process Control Product Development Lifecycle
Remote

Senior Reliability Engineer

JLL (Jones Lang LaSalle)

NJ 53 days ago $140,000$160,000
Excel CMMS EAM ISO9001 ISO55001 RCM CbM PdM BAS PLC SCADA SQL Python R Tableau Building Automation Systems Energy Management Platforms Vibration Analysis Oil Analysis Infrared Thermography Ultrasound Motor Current Analysis CMMS/EAM systems Microsoft Excel

Senior Reliability Engineer

Anduril Industries

Atlanta, GA 2 days ago $144,000$191,000
MIL-HDBK-217 MIL-HDBK-472 MIL-STD-810 MIL-STD-461 MIL-STD-516C MIL-STD-1629 DO-254 DO-178 FMEA Fault Tree Analysis Weibull analysis Predictive reliability analysis Qualification Test plans Reliability Engineering framework Highly Accelerated Life Testing Environmental Testing MIL-HDBK-217Plus HALT HASS

Senior Reliability Engineer

Medtronic

Mounds View South, MN 3 days ago $107,200$160,800
ISO_14971 ISO_13485 FMEA CAPA DFMEA PSUR Risk_Management_Report Microsoft_Word Microsoft_Excel Microsoft_PowerPoint Design_Control Root_Cause_Analysis Statistical_Data_Analysis CI/CD
Hybrid

Manager, Reliability Engineering

Johnson & Johnson

Remote (Us160 Nj Raritan - 1003 Us Highway 202 N, US) 10 days ago $102,000$175,950
AWS GCP Azure Terraform CI/CD Prometheus Grafana Jenkins xRay JFrog Artifactory New Relic Splunk Python JavaScript HTML CSS Drupal .NET SharePoint React Angular Vue DevOps Kubernetes Docker Git PostgreSQL
Remote

Senior Staff Engineer - System

Samsung Semiconductor

San Jose, CA 2 days ago $189,000$301,000
C C++ Python MATLAB Jira Gerrit Git GNSS CDMA AGC DSP filters correlators FFT AFC PLL signal processing detection theory estimation theory wireless channels random processes demodulation decoding