Principal Site Reliability Engineer

The Walt Disney Company

Remote Actively hiring
Remote, USA · Disney's Hollywood Studios - Feature Animation Building, FL Posted 51 days ago

At a glance

AI generated

TL;DR

As a Principal Site Reliability Engineer at Disney Experiences (DX) in the US Parks & Resorts and Experiences organization, you will lead and mentor a team focused on enhancing observability and reducing toil through advanced SRE practices. Your daily tasks include advocating for service level management, designing green field products, and integrating AI/LLM-assisted reliability engineering. You will also drive development pipelines, automate infrastructure, and ensure high system reliability while adhering to security standards. The role requires expertise in DevOps tools like CI/CD, containerization, and observability platforms, as well as experience with multiple cloud providers such as AWS, Azure, and GCP. Additionally, you must have a strong background in architecting scalable infrastructure using Terraform, Ansible, and Chef, and applying AI to optimize system reliability. This position is integral to maintaining the world-class digital experiences for Disney’s premier vacation brands.

Skills

AWS Azure GCP Terraform CloudFormation Ansible Chef CI/CD Docker Kubernetes Prometheus Grafana Python Linux Windows AI LLM PCI DevOps SRE SLI SLO SLA

What you'll do

  • Lead the SRE culture and mentor team members to enhance observability and reduce operational toil.
  • Advocate for service level management by advancing SLIs, SLOs, and SLAs adoption across systems.
  • Design and support green field products while evaluating build vs. buy decisions for new technologies.
  • Drive AI/LLM-assisted reliability engineering by creating secure workflows and architecting AI-enabled capabilities.
  • Automate infrastructure and operations to create telemetry for monitoring and ensure high system reliability.
  • Establish and manage systems administration requirements on Linux and Windows platforms for operational excellence.
  • Engage in estimation and planning, providing technical recommendations to improve development pipelines.

What we're looking for

  • Minimum 10 years of related work experience in Site Reliability Engineering.
  • Expertise in defining and implementing observability strategies for complex distributed systems.
  • Comprehensive hands-on experience with DevOps toolsets including CI/CD, containerization, and monitoring tools.
  • Demonstrated ability to engineer cloud-agnostic solutions across AWS, Azure, and GCP.
  • Mastery in architecting highly available, scalable infrastructure using configuration management tools.
  • Extensive experience with high-demand releases and PCI audit standards.

Market check

Salary context

This listing doesn't show a salary. Similar roles on FindRole typically pay $128,635–$212,800.

Peer median band

$128,635$212,800

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$137,229$216,485

Middle half of comparable postings.

Based on 238 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About The Walt Disney Company

The Walt Disney Company is a diversified global entertainment and media enterprise operating in segments including Disney Parks, Experiences and Products; Entertainment (ABC, Hulu, Disney+); and ESPN. Industry: Entertainment & Media

The Walt Disney Company currently has 107 open roles on FindRole.

Listed pay typically runs $143,650–$192,650 across 100 roles with salary data.

Most-posted roles

View all roles at The Walt Disney Company

More like this

Similar roles

Principal Site Reliability Engineer

The Walt Disney Company

Remote (Usa - Fl - Disney'S Hollywood Studios - Feature Animation Building, US) 44 days ago
Akamai Kona Site Defender WAF Bot Manager DevOps CI/CD Python Go Docker Terraform AWS Azure Google Cloud PostgreSQL MongoDB Redis Prometheus Grafana Kubernetes Ansible Jenkins GitLab GitHub
Remote

Sr Principal Site Reliability Engineer

The Walt Disney Company

Remote (Usa - Ca - Market St, US) 54 days ago $250,500$335,900
Kubernetes AWS CI/CD Docker Prometheus Grafana Python PostgreSQL Terraform Ansible GitOps CDN integration media streaming technologies content delivery strategies
Remote

Site Reliability Engineer

The Walt Disney Company

Remote (Usa - Fl - Disney'S Hollywood Studios - Feature Animation Building, US) 52 days ago
Akamai Splunk AppDynamics GitHub Ansible Chef AWS Azure GCP CI/CD RESTful APIs Microservices Cloud computing Python JavaScript Kubernetes Terraform Prometheus Grafana
Remote

Site Reliability Engineer

Equifax

Usa - Missouri - St. Louis - Lackland, US 46 days ago
AWS GCP Terraform Jenkins Python Bash Docker Kubernetes CI/CD Prometheus PostgreSQL Linux Windows Ansible Chef

Site Reliability Engineer

Shopify

US 30 days ago
Kubernetes Docker CI/CD Python Go PostgreSQL AWS GCP Prometheus Grafana Terraform GitOps