Principal Site Reliability Engineer

The Walt Disney Company

Remote Actively hiring

Remote, USA · Disney's Hollywood Studios - Feature Animation Building, FL Posted 51 days ago

View original post Log in to save

At a glance

AI generated

TL;DR

As a Principal Site Reliability Engineer at Disney Experiences (DX) in the US Parks & Resorts and Experiences organization, you will lead and mentor a team focused on enhancing observability and reducing toil through advanced SRE practices. Your daily tasks include advocating for service level management, designing green field products, and integrating AI/LLM-assisted reliability engineering. You will also drive development pipelines, automate infrastructure, and ensure high system reliability while adhering to security standards. The role requires expertise in DevOps tools like CI/CD, containerization, and observability platforms, as well as experience with multiple cloud providers such as AWS, Azure, and GCP. Additionally, you must have a strong background in architecting scalable infrastructure using Terraform, Ansible, and Chef, and applying AI to optimize system reliability. This position is integral to maintaining the world-class digital experiences for Disney’s premier vacation brands.

Skills

AWS Azure GCP Terraform CloudFormation Ansible Chef CI/CD Docker Kubernetes Prometheus Grafana Python Linux Windows AI LLM PCI DevOps SRE SLI SLO SLA

What you'll do

Lead the SRE culture and mentor team members to enhance observability and reduce operational toil.
Advocate for service level management by advancing SLIs, SLOs, and SLAs adoption across systems.
Design and support green field products while evaluating build vs. buy decisions for new technologies.
Drive AI/LLM-assisted reliability engineering by creating secure workflows and architecting AI-enabled capabilities.
Automate infrastructure and operations to create telemetry for monitoring and ensure high system reliability.
Establish and manage systems administration requirements on Linux and Windows platforms for operational excellence.
Engage in estimation and planning, providing technical recommendations to improve development pipelines.

What we're looking for

Minimum 10 years of related work experience in Site Reliability Engineering.
Expertise in defining and implementing observability strategies for complex distributed systems.
Comprehensive hands-on experience with DevOps toolsets including CI/CD, containerization, and monitoring tools.
Demonstrated ability to engineer cloud-agnostic solutions across AWS, Azure, and GCP.
Mastery in architecting highly available, scalable infrastructure using configuration management tools.
Extensive experience with high-demand releases and PCI audit standards.

Market check

Salary context

This listing doesn't show a salary. Similar roles on FindRole typically pay $128,635–$212,800.

Peer median band

$128,635–$212,800

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$137,229–$216,485

Middle half of comparable postings.

Based on 238 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About The Walt Disney Company

The Walt Disney Company is a diversified global entertainment and media enterprise operating in segments including Disney Parks, Experiences and Products; Entertainment (ABC, Hulu, Disney+); and ESPN. Industry: Entertainment & Media

The Walt Disney Company currently has 107 open roles on FindRole.

Listed pay typically runs $143,650–$192,650 across 100 roles with salary data.

Most-posted roles

View all roles at The Walt Disney Company

Similar roles

Principal Site Reliability Engineer

The Walt Disney Company

Remote (Usa - Fl - Disney'S Hollywood Studios - Feature Animation Building, US) 44 days ago

Akamai Kona Site Defender WAF Bot Manager DevOps CI/CD Python Go Docker Terraform AWS Azure Google Cloud PostgreSQL MongoDB Redis Prometheus Grafana Kubernetes Ansible Jenkins GitLab GitHub

Remote

Sr Principal Site Reliability Engineer

The Walt Disney Company

Remote (Usa - Ca - Market St, US) 54 days ago $250,500–$335,900

Kubernetes AWS CI/CD Docker Prometheus Grafana Python PostgreSQL Terraform Ansible GitOps CDN integration media streaming technologies content delivery strategies

Remote

Site Reliability Engineer

The Walt Disney Company

Remote (Usa - Fl - Disney'S Hollywood Studios - Feature Animation Building, US) 52 days ago

Akamai Splunk AppDynamics GitHub Ansible Chef AWS Azure GCP CI/CD RESTful APIs Microservices Cloud computing Python JavaScript Kubernetes Terraform Prometheus Grafana

Remote

Site Reliability Engineer

Equifax

Usa - Missouri - St. Louis - Lackland, US 46 days ago

AWS GCP Terraform Jenkins Python Bash Docker Kubernetes CI/CD Prometheus PostgreSQL Linux Windows Ansible Chef

Site Reliability Engineer

Shopify

US 30 days ago

Kubernetes Docker CI/CD Python Go PostgreSQL AWS GCP Prometheus Grafana Terraform GitOps

Site Reliability Engineer

Booz Allen Hamilton

US 33 days ago $62,000–$141,000

AWS Linux Docker CI/CD