Site Reliability Engineering (SRE) Manager

IBM

Quick summary

Work type
On-site
Location
Austin, TX
Posted
37 days ago

Market check

Salary context

How this pay compares to similar roles

Similar $179k
$130k most similar roles pay here $230k

This listing doesn't post a salary. Most similar roles pay $142,450–$215,462.

Based on 238 similar postings.

Employer

About IBM

IBM is a US-based global technology company providing hybrid cloud, AI, consulting, enterprise software, and IT infrastructure products and services.

IBM currently has 743 open roles on FindRole.

Listed pay typically runs $1,000,000–$1,000,000 across 8 roles with salary data.

Most-posted roles

View all roles at IBM

At a glance

TL;DR · Site Reliability Engineering (SRE) Manager

The Site Reliability Engineer (SRE) Manager position at IBM’s CISO Platform team involves leading a high-performing SRE group to ensure the internal security platforms maintain top-tier performance, resilience, and compliance. This role entails overseeing operational processes, infrastructure automation, monitoring, and incident response while ensuring adherence to enterprise security standards and regulatory requirements. Day-to-day responsibilities include driving team execution, collaborating with cross-functional teams, and fostering a culture of accountability and innovation. The ideal candidate has experience managing SRE or DevOps teams, delivering reliable services, and automating operational tasks. Key skills include proficiency in Kubernetes, OpenShift, cloud-native environments, observability tools, and scripting languages like Python, along with professional certifications such as AWS or CISSP.

What you'll do

  • Oversee implementation and automation of operational processes, infrastructure, monitoring, incident response and runbooks.
  • Own end-to-end service reliability, including SLI/SLOs, capacity planning, performance optimization and operational health.
  • Ensure platforms meet IBM CISO and enterprise security standards, regulatory requirements and risk policies.
  • Lead, develop, and mentor a team of Site Reliability Engineers; provide coaching, career development, and performance management.
  • Align team objectives with the strategic direction of the IBM CISO organization and broader Enterprise & Technology Services.

What we're looking for

  • Proven experience managing SRE teams and delivering reliable services.
  • Deep understanding of security compliance and risk management frameworks.
  • Demonstrated success in automating infrastructure and operational tasks.
  • Experience leading the development and mentoring of a high-performing team.
  • Strong background in cloud-native environments and container orchestration.
  • Excellent communication skills for influencing across multiple teams.
  • Professional Cloud or Security certifications preferred.

More like this

Similar roles

Site Reliability Engineering (SRE) Manager

IBM

Research Triangle Park, NC 37 days ago
Kubernetes CI/CD Python Terraform AWS Azure GCP IBM Cloud OpenShift Jira Scrum Ansible Prometheus Grafana Git Docker Linux Networking Security Compliance Risk Management

Sr Mgr, Site Reliability Engineer (SRE)

The Walt Disney Company

Remote (Orlando, FL) 4 days ago $175,000$215,000
AWS GCP Azure Kubernetes Terraform Ansible Harness GitLab CI/CD Prometheus Grafana Python PostgreSQL Docker AI/ML
Remote

Site Reliability Engineer

Autodesk

Atlanta, GA 16 days ago $117,000$209,330
AWS Kubernetes Terraform Python Linux Bash Docker CI/CD Jenkins Git CloudWatch Splunk Dynatrace New Relic Grafana PostgreSQL MySQL MSSQL EC2 ECS EKS Lambda ELB S3 IAM VPC DynamoDB RDS

Site Reliability Engineer

Booz Allen Hamilton

Herndon, VA 64 days ago $99,000$225,000
Java Spring Boot CI/CD Agile Bitbucket GitLab Kubernetes ArgoCD MongoDB Elasticsearch NiFi Kafka Docker Terraform AWS Grafana Prometheus Python Go

Site Reliability Engineer 5 - Live SRE

Netflix

Remote (Usa - Remote, US) 54 days ago $388,000$558,000
API Gateway IPC Load Testing Observability Monitoring Scalability L4 Load Balancer HTTP Cache Reverse Proxy Unix/Linux TCP/IP DNS TLS HTTP Go Python Rust Kafka Time Series Database Presto Trino Spark SQL
Remote