Senior Site Reliability Engineer

Oracle

Actively hiring
Nashville, TN · Austin, TX Posted 16 days ago $79,100$158,200 / year

At a glance

AI generated

TL;DR

As a Senior Site Reliability Engineer at Oracle’s Cloud Infrastructure (OCI) team, you will join a globally distributed squad dedicated to maintaining high service availability by swiftly addressing and mitigating incidents that impact OCI services. Your daily tasks will include automating routine operations, coordinating with subject matter experts during major incidents, and documenting insights for continuous improvement. You will leverage your deep understanding of cloud computing design patterns and automation principles to enhance system resilience and scalability. Key skills required are extensive experience in public cloud operations, major incident management, and proficiency in modern programming languages like Python or Java, alongside familiarity with tools such as Chef, Ansible, Jenkins, Terraform, Docker, and RESTful APIs. This role offers exposure to cutting-edge technologies and the opportunity to influence broad organizational initiatives aimed at improving OCI’s service availability on a large scale.

Skills

AWS Azure GCP OCI Major Incident Management Agile Terraform Docker CI/CD RESTful APIs Jenkins Chef Ansible Prometheus Grafana Python Go

What you'll do

  • Solve complex cloud infrastructure issues and automate tasks for continuous service availability.
  • Coordinate with subject matter experts to manage major incidents efficiently and document progress.
  • Use deep knowledge of cloud computing patterns to mitigate complex incidents effectively.
  • Develop and maintain technical architecture documentation for large-scale distributed systems.
  • Monitor high-level dashboards, identify anomalies, and take corrective actions promptly.
  • Identify opportunities for automation in incident management processes and drive continuous improvement.
  • Partner with development teams to define operational requirements for product roadmaps.

What we're looking for

  • 3+ years of experience in Site Reliability Engineering, DevOps, or System Engineering
  • Extensive public cloud operations experience (AWS, Azure, GCP, OCI)
  • Proven expertise in Major Incident Management within a cloud environment
  • Strong understanding and application of automation and orchestration principles
  • Proficiency in at least one modern object-oriented programming language
  • Experience with infrastructure automation tools like Chef, Ansible, Jenkins, Terraform
  • Expertise in Infrastructure-as-a-Service, CI/CD systems, Docker, RESTful APIs, log analysis

Market check

Salary context

This $79,100–$158,200 range sits above 13% of similar postings on FindRole.

Peer median band

$119,800$199,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$137,000$200,150

Middle half of comparable postings.

Based on 239 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Oracle

Oracle Corporation is a leading multinational technology company specializing in database software, cloud computing, and enterprise software.

Oracle currently has 251 open roles on FindRole.

Listed pay typically runs $97,500–$199,500 across 193 roles with salary data.

Most-posted roles

View all roles at Oracle

More like this

Similar roles

Senior Site Reliability Engineer

Oracle

US 14 days ago $79,100$158,200
Oracle Cloud Infrastructure Kubernetes Python Go Bash CI/CD Terraform Prometheus Grafana Linux Networking Docker SRE Incident Response SLIs/SLOs Resilience Engineering FedRAMP 3PAO

Senior Site Reliability Engineer

Adobe

San Jose, US 51 days ago $208,300$301,600
AWS Kubernetes Terraform Python Go CI/CD Infrastructure as Code Docker PostgreSQL Security hardening AI-enabled platforms Cross-team leadership Developer experience optimization

Senior Site Reliability Engineer

CoStar Group

US 11 days ago
AWS Kubernetes Docker Terraform CloudFormation Python Java C# NodeJS Bash PCI compliance REST API Microservices CDN PostgreSQL MySQL Azure Google Cloud CI/CD

Senior Site Reliability Engineer

The Federal Reserve

Boston, Ma, US 45 days ago $140,000$210,900
AWS Terraform Python Go Docker CI/CD Kubernetes EKS RDS Aurora S3 Route53 ELB IAM Consul Vault Ansible Linux Shell Scripting CloudWatch OpenSearch Grafana Prometheus

Senior Site Reliability Engineer

Carta

Seattle, Washington, US 55 days ago $181,688$213,750
AWS Terraform Python Kubernetes Docker Postgres Prometheus Grafana CI/CD gRPC Ansible ELK Stack Datadog GraphQL

Senior Site Reliability Engineer

Oracle

Reston, Virginia, US 21 days ago
Oracle Linux Ansible Terraform Python Bash Prometheus Grafana Kubernetes CI/CD Git Active Directory LDAP Kerberos GlusterFS PostgreSQL Docker AWS Azure Google Cloud Platform Nginx Apache HTTP Server