Site Reliability Engineer III

JPMorgan Chase

Quick summary

Work type
On-site
Location
Jersey City, NJDallas, TX
Salary
$133,000–$185,000 / yr
Posted
14 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $173k
This role $159k
$123k most similar roles pay here $228k

This role pays less than 65% of similar roles. Most pay $142,400–$203,875 — the shaded band above. At the midpoint, this role pays about $159k versus about $173k for comparable roles.

Based on 240 similar postings.

Employer

About JPMorgan Chase

JPMorgan Chase & Co. is a global financial services firm and one of the largest banks in the world, offering investment banking, commercial banking, asset management, and consumer financial services.

JPMorgan Chase currently has 436 open roles on FindRole.

Listed pay typically runs $152,000–$215,000 across 230 roles with salary data.

Most-posted roles

View all roles at JPMorgan Chase

At a glance

TL;DR · Site Reliability Engineer III

As a Site Reliability Engineer III at JPMorgan Chase within the Chief Data & Analytics Office AI/ML & Data Platforms team, you will play a pivotal role in maintaining and optimizing mission-critical systems. Your responsibilities include configuring, monitoring, and improving cloud infrastructure using AWS, Databricks, Snowflake, and Kubernetes, while collaborating with software engineers to implement automated CI/CD pipelines. You will leverage Python or PySpark for AI/ML automation and use tools like Grafana, Prometheus, and Splunk for observability. Additionally, you will apply enterprise-authorized AI capabilities to enhance incident resolution and proactively address issues before they impact customers, ensuring the reliability and scalability of applications through rigorous testing and disaster recovery practices.

What you'll do

  • Implements infrastructure, configuration, and network as code for applications and platforms.
  • Uses AI capabilities to accelerate incident triage, troubleshooting, and post-incident analysis in compliance with security requirements.
  • Applies AI to identify patterns indicating reliability risk or recurring issues, prioritizing improvements tied to SLO outcomes.
  • Develops and supports AI/ML solutions for incident resolution and intelligent automation for troubleshooting.
  • Validates AI-assisted operational recommendations before applying changes, ensuring compliance with data sensitivity and company standards.

What we're looking for

  • Proficient in site reliability engineering principles and implementing SRE within applications, including SLI/SLO/SLA understanding.
  • Experience with AWS Cloud, Databricks, Snowflake, Kubernetes, and continuous integration/delivery tools.
  • Hands-on experience in system design, resiliency, testing, operational stability, and disaster recovery.
  • Proficient in at least one programming language (Python, Java/Spring Boot, .Net) for AI/ML modeling and automation.
  • Working knowledge of using enterprise-authorized AI capabilities to support SRE workflows with data sensitivity awareness.
  • Experience in observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk for monitoring and alerting.
  • 4+ years in an SRE or production support role with experience running production incident calls.

More like this

Similar roles

Site Reliability Engineer III

Electronic Arts

Kirkland, WA +1 24 days ago $114,300$156,200
AWS Terraform Kubernetes Python OpenSearch Elasticsearch Prometheus Grafana CI/CD GitLab Vault OIDC Venafi Rust Redis Valkey kubectl Lens S3 SQS IAM Route53
Hybrid

Site Reliability Engineer III

Electronic Arts

Kirkland, WA +1 15 days ago $114,300$156,200
Kubernetes AWS Terraform Python Prometheus Grafana OpenSearch Bash PowerShell Linux Windows
Hybrid

Site Reliability Engineer III

Electronic Arts

Hyderabad, Telangana, India 14 days ago
AWS Kubernetes Terraform Docker CI/CD Prometheus Grafana Python Linux Unix Networking Helm Ansible Bash Java SQL NoSQL
Hybrid

Site Reliability Engineer

Equifax

St. Louis, Missouri +1 73 days ago
AWS GCP Terraform Jenkins Python Bash Docker Kubernetes CI/CD Prometheus PostgreSQL Linux Windows Ansible Chef
Hybrid

Site Reliability Engineer

Shopify

Europe 58 days ago
Kubernetes Docker CI/CD Python Go PostgreSQL AWS GCP Prometheus Grafana Terraform GitOps