Principal Infrastructure & Reliability Engineer

Oracle

Quick summary

Work type
On-site
Location
Posted
50 days ago

Market check

Salary context

How this pay compares to similar roles

Similar $181k
$128k most similar roles pay here $231k

This listing doesn't post a salary. Most similar roles pay $142,400–$219,250.

Based on 240 similar postings.

Employer

About Oracle

Oracle Corporation is a leading multinational technology company specializing in database software, cloud computing, and enterprise software.

Oracle currently has 467 open roles on FindRole.

Listed pay typically runs $97,500–$209,500 across 353 roles with salary data.

Most-posted roles

View all roles at Oracle

At a glance

TL;DR · Principal Infrastructure & Reliability Engineer

As a Principal Infrastructure & Reliability Engineer at Oracle's Health Data Intelligence team, you will design, build, and operate highly reliable, scalable infrastructure for large-scale healthcare analytics platforms. Your day-to-day responsibilities include advancing automation, observability, and AI-assisted reliability practices by leveraging technologies such as Generative AI, Kubernetes, Terraform, Prometheus, Grafana, and Python or Java. You will work closely with a collaborative team to handle massive datasets, improve system resilience, and enhance operational efficiency in multi-cloud environments like OCI, AWS, and Azure. This role requires extensive experience in cloud infrastructure, SRE, DevOps, and a strong understanding of distributed systems and data warehousing platforms.

What you'll do

  • Design, build, and operate reliable, scalable infrastructure for large-scale analytics workloads.
  • Improve system reliability through automation, monitoring, and performance optimization.
  • Contribute to the adoption of AI-assisted approaches for operations, including observability and incident response.
  • Partner with development teams to enhance service architecture and operability.
  • Perform root cause analysis and implement long-term fixes for complex production issues.
  • Drive continuous improvement in DevOps/SRE practices, including CI/CD and automation at scale.

What we're looking for

  • Experience building and operating high-availability, fault-tolerant systems.
  • Strong understanding of distributed systems, performance monitoring, and resiliency patterns.
  • Hands-on experience with AI-driven automation in infrastructure lifecycle management.
  • Deep expertise in multi-cloud environments (OCI, AWS/Azure) and cloud infrastructure design.
  • Advanced competency in CI/CD pipelines, Infrastructure as Code, and observability tools.
  • Proficiency in data warehousing platforms and large-scale ETL frameworks.
  • Strong problem-solving skills with a focus on automation-first operations.

More like this

Similar roles

Principal Reliability Engineer

Medtronic

Billerica, MA 47 days ago $149,500$187,200
Root_Cause_Analysis FDA_21_CFR_Part_820 ISO_14971 ISO_13485 ISO_9001 ISO_10012 ISO_17025 Lean_Six_Sigma GMP GDP CAPA Risk_Analysis FMEA Verification_and_Validation Design_of_Experiments Statistical_Analysis Installation_Qualification Operational_Qualification Performance_Qualification Test_Method_Validations Capital_Equipment_Design Single_Use_Device_Design
Hybrid

Principal Reliability Engineer

Medtronic

Remote (Usa-Mn Plymouth Berkshire, US) 5 days ago $132,000$198,000
Python SQL DOE SPC Risk Management Supplier Quality Change Management Reliability Engineering Verification Validation Testing Oversight Design Controls Statistical Analysis
Remote Hybrid

Principal, Infrastructure Engineering

The OCC

Dallas, TX 37 days ago $209,394$223,700
AWS Kubernetes Jenkins GitHub Actions Terraform Python CI/CD Azure VMware Cisco Java MSSQL CloudFormation Microservices Serverless Multicloud Hybrid Cloud Compliance Regulatory Strategy
Hybrid

Lead Infrastructure Engineer

Salesforce

Remote (Indianapolis, Indiana) 18 days ago $172,500$260,100
Google Workspace Microsoft O365 Zoom Teams Confluence DNS SaaS PaaS IaaS virtualization Kubernetes AWS Azure GCP CI/CD PostgreSQL MongoDB Git Terraform Ansible Python JavaScript REST APIs OAuth LDAP Active Directory NIST ISO 27001 PCI DSS
Remote