Technical Operations & Site Reliability Engineer, Customer Systems

Apple Inc

Quick summary

Work type
On-site
Location
Sunnyvale, CA
Salary
$147,400–$272,100 / yr
Posted
56 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $168k
This role $210k
$113k most similar roles pay here $289k

This role pays more than 81% of similar roles. Most pay $137,375–$199,200 — the shaded band above. At the midpoint, this role pays about $210k versus about $168k for comparable roles.

Based on 239 similar postings.

Employer

About Apple Inc

Apple Inc. is a multinational technology company known for designing and manufacturing consumer electronics, software, and online services, including the iPhone, Mac, iPad, and App Store. Industry: Consumer Electronics & Software

Apple Inc currently has 1723 open roles on FindRole.

Listed pay typically runs $162,500–$272,100 across 1398 roles with salary data.

Most-posted roles

View all roles at Apple Inc

At a glance

TL;DR · Technical Operations & Site Reliability Engineer, Customer Systems

As a Technical Operations & Site Reliability Engineer at Apple’s Customer Systems team, you will play a pivotal role in maintaining the reliability and performance of business-critical, globally distributed systems. Your day-to-day responsibilities include managing large-scale production outages, designing automation solutions to streamline monitoring and operational workflows, and developing tools using Java/JEE, REST, Swift/Objective-C, Python, Go, or Bash to enhance system reliability. You will collaborate closely with support, engineering, and business operations teams to improve efficiency and stability while driving operational metrics and KPIs. Ideal candidates possess strong software engineering skills, experience in AI and LLM models for operational tasks, and a deep understanding of networking protocols and distributed systems. This role requires expertise in scripting languages, automation tools, and the ability to work effectively in a fast-paced, 24x7 environment across multiple locations.

What you'll do

  • Manage large-scale production outages by leading incident response and improving efficiency.
  • Design and build automation solutions to streamline monitoring and management of distributed systems.
  • Develop tools using Java/JEE, REST, Python, Go, or Bash to automate operational tasks and improve reliability.
  • Plan and execute system health monitoring and incident response across critical global applications.
  • Create and maintain accurate documentation for architecture, infrastructure configuration, and procedures.
  • Utilize AI and LLM models to enhance operational efficiency in application support.

What we're looking for

  • Experience in interpreting operational data from monitoring tools like Hubble, Splunk, and ExtraHop.
  • B.S. in Computer Science or equivalent work experience in technical operations.
  • Proficiency in scripting languages such as Java, JEE, REST, Swift/Objective-C, Python, Go, Bash.
  • Understanding of standard networking protocols including HTTP, DNS, TCP/IP, ICMP, OSI Model.
  • Experience using AI and LLMs to enhance operational efficiency through model training and optimization.

More like this

Similar roles

Site Reliability Engineer, Customer Systems, IS&T

Apple Inc

Sunnyvale, CA 23 days ago $147,400$220,900
Kubernetes Helm Shell Scripting Python Ansible Splunk Grafana Prometheus Alertmanager CI/CD DNS TCP HTTP HTTPS ArgoCD GitOps Metric Monitoring SLO MTTR GenAI Workflow Automation

Site Reliability Engineer, Customer Systems, IS&T

Apple Inc

Sunnyvale, CA 6 days ago $147,400$220,900
Kubernetes Helm Shell Scripting Python Ansible Splunk Grafana Prometheus Alertmanager CI/CD DNS TCP HTTP HTTPS ArgoCD GitOps Metric Monitoring SLO MTTR GenAI Workflow Automation

Sr Site Reliability Engineer, Customer Systems

Apple Inc

Austin, TX 23 days ago
Kubernetes Helm Python Shell Scripting Ansible Splunk Grafana Prometheus Alertmanager CI/CD DNS TCP HTTP AWS S3 Cassandra MongoDB Couchbase Java ArgoCD GitOps MTTR SLO GenAI

Operations Reliability Engineer

Apple Inc

Cupertino, CA 56 days ago $147,400$272,100
Reliability Testing Failure Analysis Mechanical Stress Tests Environmental Testing Fatigue Testing Optical Microscopy X-Ray SEM/EDS FTIR XPS DOE FMEA Statistical Process Control Machine Learning Big Data Design of Experiments Accelerated Test Models Reliability Models JEDEC ASTM IEEE