Senior Site Reliability Engineer

Oracle

Quick summary

Work type
On-site
Location
Austin, TX
Salary
$83,000–$187,000 / yr
Posted
10 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $171k
This role $135k
$68k most similar roles pay here $226k

This role pays less than 82% of similar roles. Most pay $142,450–$198,850 — the shaded band above. At the midpoint, this role pays about $135k versus about $171k for comparable roles.

Based on 239 similar postings.

Employer

About Oracle

Oracle Corporation is a leading multinational technology company specializing in database software, cloud computing, and enterprise software.

Oracle currently has 755 open roles on FindRole.

Listed pay typically runs $97,500–$209,500 across 568 roles with salary data.

Most-posted roles

View all roles at Oracle

At a glance

TL;DR · Senior Site Reliability Engineer

As a Senior Site Reliability Engineer at Oracle’s OCI Incident Response team, you will join a globally distributed group dedicated to minimizing customer-impacting events through efficient incident management and automation. Your day-to-day responsibilities include leading the triage and resolution of major incidents, collaborating with subject matter experts to restore services quickly, and documenting insights for process improvement. You will work on designing high-scale, secure, and resilient systems while partnering with development teams to define operational requirements and guide engineering efforts. The role requires expertise in cloud computing design patterns, incident management methodologies, and automation tools such as Chef, Ansible, Jenkins, and Terraform. With a focus on Oracle Cloud Infrastructure (OCI), you will contribute to the continuous evolution of OCI’s state-of-the-art systems, ensuring high availability and performance for global customers.

What you'll do

  • Solve complex cloud infrastructure problems and automate tasks to ensure minimal human intervention.
  • Coordinate with subject matter experts during major incidents to restore services quickly and accurately document progress.
  • Utilize deep knowledge of cloud computing design patterns to mitigate complex incidents effectively.
  • Implement a systematic approach for troubleshooting large, interconnected systems in incident detection and orchestration.
  • Document incident details to improve processes, identify deviations, and build an incident knowledge base.
  • Monitor high-level service dashboards, identifying and addressing anomalies proactively.
  • Identify opportunities for automation and continuous improvement of the incident management process.

What we're looking for

  • 3+ years of experience in Site Reliability Engineering, DevOps, or System Engineering
  • Extensive public cloud operations experience (AWS, Azure, GCP, OCI)
  • Proven expertise in Major Incident Management within a cloud environment
  • Proficiency in at least one modern object-oriented programming language
  • Strong familiarity with infrastructure automation tools like Chef, Ansible, Jenkins, Terraform
  • Excellent knowledge of IaaS, CI/CD systems, Docker, RESTful APIs, and log analysis tools

More like this

Similar roles

Senior Site Reliability Engineer

Oracle

Nashville, TN +1 33 days ago $79,100$158,200
AWS Azure GCP OCI Major Incident Management Agile Terraform Docker CI/CD RESTful APIs Jenkins Chef Ansible Prometheus Grafana Python Go

Senior Site Reliability Engineer

Adobe

San Jose 69 days ago $208,300$301,600
AWS Kubernetes Terraform Python Go CI/CD Infrastructure as Code Docker PostgreSQL Security hardening AI-enabled platforms Cross-team leadership Developer experience optimization

Senior Site Reliability Engineer

Carta

San Francisco, California +2 73 days ago $181,688$213,750
AWS Terraform Python Kubernetes Docker Postgres Prometheus Grafana CI/CD gRPC Ansible ELK Stack Datadog GraphQL
Hybrid

Senior Site Reliability Engineer

Oracle

Reston, VA +2 38 days ago
Oracle Linux Ansible Terraform Python Bash Prometheus Grafana GlusterFS Active Directory LDAP Kerberos CI/CD PostgreSQL Docker Kubernetes Git Jenkins

Senior Site Reliability Engineer

The Federal Reserve

Boston, MA 20 days ago $140,000$210,900
AWS Terraform Python Docker EKS RDS Aurora S3 Route53 ELB IAM CloudWatch OpenSearch Grafana Prometheus CI/CD Kubernetes Ansible Linux Shell scripting EC2 EBS Observability

Senior Site Reliability Engineer

Anduril Industries

Costa Mesa, CA 12 days ago $166,000$220,000
Linux Python Terraform Kubernetes Docker Ansible Networking Security CI/CD Monitoring Splunk AWS Azure GCP