Site Reliability Engineer (Edge Services), Infrastructure Services

Apple Inc

Quick summary

Work type
On-site
Location
Austin, TX
Posted
18 days ago

Market check

Salary context

How this pay compares to similar roles

Similar $182k
$128k most similar roles pay here $233k

This listing doesn't post a salary. Most similar roles pay $142,412–$222,000.

Based on 238 similar postings.

Employer

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software

Apple Inc currently has 638 open roles on FindRole.

Listed pay typically runs $171,600–$272,100 across 505 roles with salary data.

Most-posted roles

View all roles at Apple Inc

At a glance

TL;DR · Site Reliability Engineer (Edge Services), Infrastructure Services

Join our Infrastructure Services team as a Site Reliability Engineer (SRE) focusing on Edge Services, where you will drive the evolution of our production ecosystems by designing and implementing advanced observability and alerting strategies. Your daily tasks include automating repetitive operations, optimizing traffic flow using deep networking expertise, and collaborating with development teams to integrate reliability into CI/CD pipelines. You will leverage Python or Go for automation, manage modern monitoring tools like Prometheus and Grafana, and apply Data Structures and Algorithms to enhance system performance. Ideal candidates have experience in cloud environments such as AWS, GCP, or Azure, Kubernetes orchestration, and leading blameless post-mortems to improve system resilience. Additionally, familiarity with Generative AI tools for observability and debugging is highly valued, aiming to shift from reactive to proactive engineering practices.

What you'll do

  • Design and implement advanced observability and alerting strategies for high-cardinality data.
  • Build self-healing systems through aggressive automation to reduce operational toil.
  • Partner with development teams to integrate reliability into CI/CD pipelines.
  • Optimize traffic flow by debugging protocol-level issues in HTTP/2, HTTP/3, and HTTPS/TLS.
  • Manage modern monitoring suites like Prometheus, Grafana, and ClickHouse for high-quality alerts.
  • Consult on service design to enhance long-term maintainability and resilience.

What we're looking for

  • Deep understanding of Linux internals and expertise in HTTP/2, HTTP/3 (QUIC), and HTTPS/TLS.
  • Proven ability to automate tasks using Python or Go for complex workflows.
  • Experience configuring modern monitoring tools like Prometheus, Grafana, and ClickHouse.
  • Knowledge of SLIs, SLOs, error budgets, release management, and incident management.
  • Practical application of data structures and algorithms to optimize system performance.
  • Hands-on experience with Kubernetes for scaling and securing containerized workloads.

More like this

Similar roles

Sr Site Reliability Engineer, Customer Systems

Apple Inc

Austin, TX 17 days ago
Kubernetes Helm Python Shell Scripting Ansible Splunk Grafana Prometheus Alertmanager CI/CD DNS TCP HTTP AWS S3 Cassandra MongoDB Couchbase Java ArgoCD GitOps MTTR SLO GenAI

Site Reliability Engineer |||

CME Group

Chicago, IL 123 days ago $100,700$167,800
GCP Docker Kubernetes Python Java Oracle Postgres BigQuery SLO SLI SLA OpenTelemetry Splunk Prometheus Grafana CI/CD Bamboo JIRA Git

Site Reliability Engineer, Discovery

Anduril Industries

Seattle, WA 2 days ago $166,000$220,000
AWS Azure GCP Kubernetes CI/CD Rust Go Python C++ PostgreSQL Docker Terraform Prometheus GitLab Ansible