Senior Incident & Automation Engineer (AIOps / Reliability) Vice President

Citi

Actively hiring
Irving, TX Posted 28 days ago $125,760$188,640 / year

At a glance

AI generated

TL;DR

The Senior Incident & Automation Engineer role within the Technology team focuses on reducing operational noise and enhancing intelligent event management in large-scale enterprise environments. This position involves analyzing incident patterns to identify root causes, designing rules for event correlation and automation, and developing self-healing capabilities for infrastructure incidents. The engineer will also collaborate with cross-functional teams to ensure comprehensive observability and continuously validate the effectiveness of implemented solutions. Required skills include deep expertise in enterprise infrastructure, proficiency in modern AIOps platforms, experience with scripting languages for automation, and strong data analysis abilities using query languages. Candidates should have at least 8 years of hands-on experience in IT operations or system architecture within large-scale environments, along with proven success in event management initiatives.

Skills

AIOps Kubernetes Terraform Python Shell Ansible Prometheus Grafana PostgreSQL ELK_stack CI/CD Docker AWS Azure Google_Cloud_Platform ITIL SRE Infrastructure_as_Code Logstash Elasticsearch Kibana

What you'll do

  • Conduct comprehensive analysis of alert and incident patterns to identify root causes.
  • Design and optimize rules for event correlation on AIOps platforms.
  • Develop automation playbooks for incident data enrichment and self-healing capabilities.
  • Assess observability across infrastructure domains to propose enhancements.
  • Continuously validate the effectiveness of implemented rules and automation.

What we're looking for

  • Minimum 8+ years of hands-on experience in IT operations and infrastructure engineering.
  • Proven success leading event management and incident reduction initiatives with quantifiable results.
  • Deep understanding of enterprise infrastructure including virtualization, container orchestration, microservices, and storage architectures.
  • Expertise with monitoring tools for compute, virtualization, storage, and cloud platforms.
  • Hands-on experience developing automation solutions using scripting languages and modern frameworks.
  • Proficiency in log analysis, pattern recognition, and data analysis on log aggregation platforms.
  • Excellent analytical skills and systematic approach to troubleshooting complex issues.

Market check

Salary context

This $125,760–$188,640 range sits above 49% of similar postings on FindRole.

Peer median band

$125,760$198,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$133,900$192,025

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Citi

Citi is one of the world’s most trusted financial institutions, proudly serving millions of customers across the United States.

Citi currently has 336 open roles on FindRole.

Listed pay typically runs $125,760–$188,640 across 308 roles with salary data.

Most-posted roles

View all roles at Citi

More like this

Similar roles

Site Reliability Engineer Lead - Senior Vice President

Citi

Remote (388 Greenwich Street - Tower, US) 50 days ago $176,720$265,080
Kubernetes OpenShift Prometheus Grafana Terraform Ansible Helm Python Java Go AWS Google Cloud Azure CI/CD Disaster Recovery Infrastructure as Code Observability SLOs SLIs Error Budgets Chaos Engineering
Remote

Service Reliability Engineer - Assistant Vice President

Deutsche Bank

Cary, 3000 Centregreen Way, US 94 days ago $100,000$153,000
GCP Terraform Kubernetes Docker CI/CD Python Java NodeJS Shell GitHub Actions GitLab Ansible UCD Helm Azure AWS PostgreSQL Oracle RabbitMQ Redis Distributed Systems Observability

Engineering Lead Analyst - Vice President

Citi

6400 Las Colinas Blvd Irving, US 59 days ago
Python TypeScript React MongoDB Generative AI Docker Kubernetes CI/CD PyTest Playwright DeepSpeed vLLM GPTQ Node.js GraphQL

QA Automation Engineer – Assistant Vice President

Citi

Remote (6400 Las Colinas Blvd Irving, US) 28 days ago $107,120$160,680
Selenium WebDriver Playwright Cucumber Postman SoapUI Rest Assured Jenkins GitHub SQL CI/CD Karate Zephyr Splunk Jira
Remote

Automation Test Engineer- Vice President

Citi

Remote (3800 Citigroup Center Drive Building B Tampa, US) 71 days ago $103,920$155,880
Java Spring Boot Rest Assured Cucumber BDD Oracle DB MongoDB SQL NoSQL Jenkins GitLab CI CI/CD JSON XML Selenium
Remote

ML Operations Engineer - Associate Vice President

Citi

Remote (6400 Las Colinas Blvd Irving, US) 66 days ago $107,120$160,680
Python MLflow Ray Tune Kubernetes Docker CI/CD Apache Spark Apache Iceberg FLINK Kafka PostgreSQL Oracle MongoDB Prometheus Grafana Terraform Apache Airflow
Remote