Site Reliability Engineering (SRE) Manager

IBM

Quick summary

Work type
On-site
Location
Research Triangle Park, NC
Posted
37 days ago

Market check

Salary context

How this pay compares to similar roles

Similar $179k
$130k most similar roles pay here $230k

This listing doesn't post a salary. Most similar roles pay $142,450–$215,462.

Based on 238 similar postings.

Employer

About IBM

IBM is a US-based global technology company providing hybrid cloud, AI, consulting, enterprise software, and IT infrastructure products and services.

IBM currently has 743 open roles on FindRole.

Listed pay typically runs $1,000,000–$1,000,000 across 8 roles with salary data.

Most-posted roles

View all roles at IBM

At a glance

TL;DR · Site Reliability Engineering (SRE) Manager

The Site Reliability Engineer (SRE) Manager position at IBM’s CISO Platform team involves leading a high-performing SRE group to ensure the internal security platforms maintain top-tier performance, resilience, and compliance. This role entails overseeing operational processes, infrastructure automation, monitoring, and incident response while ensuring adherence to security standards and regulatory requirements. Daily tasks include driving team execution, collaborating with cross-functional teams, and fostering a culture of accountability and innovation. The ideal candidate has experience managing engineering or SRE teams, delivering reliable services, and automating infrastructure tasks. Key skills include expertise in Kubernetes, OpenShift, cloud-native environments, observability tools, and scripting languages like Python. This role demands a deep understanding of security frameworks and the ability to influence technology roadmaps while balancing current system support with future-state design initiatives.

What you'll do

  • Oversee implementation and automation of operational processes, infrastructure, monitoring, incident response and runbooks.
  • Own end-to-end service reliability, including SLI/SLOs, capacity planning, performance optimization and operational health.
  • Ensure platforms meet IBM CISO and enterprise security standards, regulatory requirements and risk policies.
  • Lead, develop, and mentor a team of Site Reliability Engineers; provide coaching, career development, and performance management.
  • Align team objectives with the strategic direction of the IBM CISO organization and broader Enterprise & Technology Services.

What we're looking for

  • Proven experience managing SRE teams and delivering reliable services.
  • Deep understanding of security compliance and risk management frameworks.
  • Demonstrated success in automating infrastructure and operational tasks.
  • Experience leading the development and mentoring of Site Reliability Engineers.
  • Strong background in Kubernetes or similar container orchestration platforms.
  • Familiarity with observability tools, networking fundamentals, and IaC practices.
  • Excellent communication skills for influencing across teams and leadership.

More like this

Similar roles

Sr Mgr, Site Reliability Engineer (SRE)

The Walt Disney Company

Remote (Orlando, FL) 4 days ago $175,000$215,000
AWS GCP Azure Kubernetes Terraform Ansible Harness GitLab CI/CD Prometheus Grafana Python PostgreSQL Docker AI/ML
Remote

Site Reliability Engineer

Autodesk

Atlanta, GA 16 days ago $117,000$209,330
AWS Kubernetes Terraform Python Linux Bash Docker CI/CD Jenkins Git CloudWatch Splunk Dynatrace New Relic Grafana PostgreSQL MySQL MSSQL EC2 ECS EKS Lambda ELB S3 IAM VPC DynamoDB RDS

Site Reliability Engineer

Booz Allen Hamilton

Herndon, VA 64 days ago $99,000$225,000
Java Spring Boot CI/CD Agile Bitbucket GitLab Kubernetes ArgoCD MongoDB Elasticsearch NiFi Kafka Docker Terraform AWS Grafana Prometheus Python Go

Site Reliability Engineer 5 - Live SRE

Netflix

Remote (Usa - Remote, US) 54 days ago $388,000$558,000
API Gateway IPC Load Testing Observability Monitoring Scalability L4 Load Balancer HTTP Cache Reverse Proxy Unix/Linux TCP/IP DNS TLS HTTP Go Python Rust Kafka Time Series Database Presto Trino Spark SQL
Remote