Site Reliability Engineer II

CME Group

Hybrid Actively hiring
New York, NY · Chicago, IL Posted 18 days ago $93,900$156,500 / year

At a glance

AI generated

TL;DR

CME Group is hiring a Site Reliability Engineer II to join their Markets portfolio team in New York City or Chicago, focusing on the Globex trading platform. This role involves working with senior engineers and product teams to enhance observability, monitoring, and alerting for critical services while implementing AI-driven reliability solutions such as anomaly detection and predictive alerting. The SRE will also contribute to disaster recovery testing, support GCP migrations, and leverage LLMs to automate runbooks and streamline log analysis. Essential skills include Linux system experience, Python or Bash scripting, problem-solving abilities, and strong communication. Preferred qualifications are hands-on AI/ML for operations, AIOps platform expertise, cloud-based platforms like GCP, traditional observability tools, Kubernetes knowledge, networking basics, and financial markets experience in an Agile environment.

Skills

Python Bash Google Cloud Platform GCP Kubernetes Prometheus Grafana Dynatrace New Relic Moogsoft BigPanda LLMs LangChain LlamaIndex PagerDuty AIOps OpenTelemetry Splunk Linux AI ML AIOps

What you'll do

  • Implement AI-driven reliability solutions, including anomaly detection and predictive alerting in production environments.
  • Write scripts and tools to reduce operational toil and improve system velocity.
  • Participate in on-call rotation and assist with incident response under senior guidance.
  • Contribute to disaster recovery testing and improvements for systems resiliency.
  • Support the migration of markets applications to Google Cloud Platform (GCP).

What we're looking for

  • Experience with Linux-based systems and scripting skills (Python, Bash).
  • Implement AI-driven reliability solutions including anomaly detection and predictive alerting.
  • Strong problem-solving abilities and experience with cloud platforms like GCP.
  • Collaborate on disaster recovery testing and improvements in a fast-paced environment.
  • Leverage LLMs and generative AI for incident management and log analysis.
  • Programming skills to write scripts and tools for reducing operational toil.

Market check

Salary context

This $93,900–$156,500 range sits above 24% of similar postings on FindRole.

Peer median band

$120,000$198,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$126,562$195,000

Middle half of comparable postings.

Based on 238 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About CME Group

CME Group operates the world''s largest financial derivatives marketplace, offering futures and options products across interest rates, equity indexes, foreign exchange, energy, agricultural products, and metals. Industry: Financial Exchanges & Derivatives

CME Group currently has 11 open roles on FindRole.

Listed pay typically runs $117,050–$195,050 across 10 roles with salary data.

Most-posted roles

View all roles at CME Group

More like this

Similar roles

Site Reliability Engineer II

CME Group

Chicago - 20 S. Wacker, US 30 days ago $93,900$156,500
Google Cloud Platform Kubernetes Python Bash OpenTelemetry Splunk Prometheus Grafana Linux Distributed systems Networking(HTTP/TCP/UDP/IP) Message-oriented middleware Agile methodologies

Site Reliability Engineer |||

CME Group

Chicago - 20 S. Wacker, US 115 days ago $100,700$167,800
GCP Docker Kubernetes Python Java Oracle Postgres BigQuery SLO SLI SLA OpenTelemetry Splunk Prometheus Grafana CI/CD Bamboo JIRA Git

Staff Site Reliability Engineer

CME Group

Chicago - 20 S. Wacker, US 26 days ago $132,100$220,100
GCP Kubernetes Python Terraform ArgoCD Go Node.js CI/CD Distributed Systems Generative AI Agile PostgreSQL GitOps CICD SLI SLO Error Budgets

Site Reliability Engineer II

The Walt Disney Company

Remote (Usa - Ny - 7 Hudson Square, US) 10 days ago $123,000$165,000
AWS Kubernetes Terraform Python Go Docker CI/CD Prometheus Grafana Bash Jenkins Infrastructure-as-Code GitOps SLO/SLI Service_mesh Performance_testing Message_queues AI_assisted_development_tools
Remote

Site Reliability Engineer

The Walt Disney Company

Remote (Usa - Fl - Disney'S Hollywood Studios - Feature Animation Building, US) 50 days ago
Akamai Splunk AppDynamics GitHub Ansible Chef AWS Azure GCP CI/CD RESTful APIs Microservices Cloud computing Python JavaScript Kubernetes Terraform Prometheus Grafana
Remote

Site Reliability Engineer

Equifax

Usa - Missouri - St. Louis - Lackland, US 44 days ago
AWS GCP Terraform Jenkins Python Bash Docker Kubernetes CI/CD Prometheus PostgreSQL Linux Windows Ansible Chef