IB CTO Team - Site Reliability Engineer (SRE) - Assistant Vice President

Deutsche Bank

Hybrid

Quick summary

Work type
Hybrid
Location
Cary, NC
Salary
$100,000–$153,000 / yr
Posted
37 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $182k
This role $126k
$86k most similar roles pay here $227k

This role pays less than 93% of similar roles. Most pay $151,475–$211,787 — the shaded band above. At the midpoint, this role pays about $126k versus about $182k for comparable roles.

Based on 240 similar postings.

Employer

About Deutsche Bank

Deutsche Bank is a German multinational investment bank and financial services company offering corporate banking, investment banking, retail banking, asset management, and transaction banking worldwide. Industry: Investment Banking & Financial Services

Deutsche Bank currently has 28 open roles on FindRole.

Listed pay typically runs $122,500–$153,000 across 28 roles with salary data.

Most-posted roles

View all roles at Deutsche Bank

At a glance

TL;DR · IB CTO Team - Site Reliability Engineer (SRE) - Assistant Vice President

We are seeking a senior Site Reliability Engineer (SRE) to join our global team, focusing on the operational health and reliability of the CARE platform across both GCP and on-prem infrastructure. This role involves proactively monitoring and resolving issues related to availability, performance, and capacity while implementing SRE best practices such as incident response and root cause analysis. You will drive automation efforts using tools like Kubernetes, Istio, Prometheus, and Grafana, and manage deployment tooling with ArgoCD and Terraform. Additionally, you will collaborate closely with application teams to ensure compliance with security policies and operational excellence, requiring a strong background in SRE principles, GCP services, and DevOps practices.

What you'll do

  • Proactively monitor and troubleshoot platform availability, performance, and capacity.
  • Develop and maintain SRE best practices for incident response and root cause analysis.
  • Drive automation efforts to reduce manual tasks in deployment and recovery processes.
  • Define and report on Service Level Objectives (SLOs) and Indicators (SLIs).
  • Collaborate with application teams to provide guidance on platform reliability and capacity planning.
  • Ensure the platform adheres to security policies and compliance requirements.

What we're looking for

  • Strong understanding of SRE principles, including SLOs/SLIs and incident management.
  • Extensive experience with GCP services and Kubernetes for configuration, troubleshooting, and performance tuning.
  • Proficiency in monitoring tools like Prometheus, Grafana, and Google Cloud Monitoring to define effective alerts and dashboards.
  • Solid experience with Git/GitHub workflows and deployment tooling such as ArgoCD for managing application lifecycles.
  • Programming/scripting skills (Python, Go, Java, Bash) and Infrastructure as Code (Terraform) for automation and data analysis.
  • Deep knowledge of DevOps best practices, including continuous integration, delivery, and automated testing from an operational perspective.
  • Excellent problem-solving abilities to diagnose and resolve complex technical issues in distributed systems.

More like this

Similar roles

Site Reliability Engineer Lead - Senior Vice President

Citi

Remote (388 Greenwich Street - Tower, US) 3 days ago $176,720$265,080
Kubernetes OpenShift Prometheus Grafana Terraform Ansible Helm Python Java Go AWS Google Cloud Azure CI/CD Disaster Recovery Infrastructure as Code Observability SLOs SLIs Error Budgets Chaos Engineering
Remote