Senior Reliability Engineer (Remote)

Kohl's

Remote

Quick summary

Work type
Remote
Location
Remote
Posted
2 days ago

Market check

Salary context

How this pay compares to similar roles

Similar $168k
$130k most similar roles pay here $212k

This listing doesn't post a salary. Most similar roles pay $139,375–$196,750.

Based on 240 similar postings.

Employer

About Kohl's

Kohl’s is a leading American omnichannel retailer operating over 1,100 department stores in 49 states and a strong e-commerce site (Kohls.com).

Kohl's currently has 13 open roles on FindRole.

Most-posted roles

View all roles at Kohl's

At a glance

TL;DR · Senior Reliability Engineer (Remote)

As a Senior Reliability Engineer at Kohl’s, you will join the Site Reliability Engineering team to enhance system resilience and availability. Your daily tasks include driving error budget adoption, conducting root cause analysis during incidents, and implementing robust monitoring practices. You will automate repetitive tasks, optimize operational processes, and mentor engineers on reliability best practices. Key responsibilities involve proactive identification of potential failures using chaos engineering techniques and advising on capacity planning. The role requires strong programming skills in languages like Java, Python, or Go, along with expertise in cloud platforms such as AWS or GCP, monitoring tools like Prometheus, and container orchestration systems like Kubernetes. This position addresses the critical need for system reliability at scale within Kohl’s expansive e-commerce infrastructure.

What you'll do

  • Drive adoption of error budgets and Service Level Objectives across Kohl’s products.
  • Conduct root cause analysis during incidents to implement preventative measures.
  • Implement robust monitoring and failover mechanisms for system reliability.
  • Identify opportunities for automation to reduce operational toil and risks.
  • Perform on-call duties, conduct blameless retrospectives, and drive continuous improvements.
  • Proactively identify potential failures using chaos engineering techniques.

What we're looking for

  • 4+ years of software development experience.
  • Strong programming skills in Java, Python, Go, or Node.js.
  • In-depth knowledge of systems architecture and network fundamentals.
  • Experience with multi-region application troubleshooting and performance tuning.
  • Working experience with cloud platforms (GCP, AWS, Azure) and monitoring tools.
  • Proficiency in incident response, root cause analysis, and chaos engineering.

More like this

Similar roles

Manager, Reliability Engineering (Remote)

Kohl's

Remote (Kohl'S Corporate Offices (0900), US) 2 days ago
AWS Kubernetes Docker Python Java Go Node.js CI/CD MLOps Prometheus Grafana CloudWatch OpenTelemetry Ansible Chef Rancher
Remote

Senior Reliability Engineer

Anduril Industries

Atlanta, GA 2 days ago $144,000$191,000
MIL-HDBK-217 MIL-HDBK-472 MIL-STD-810 MIL-STD-461 MIL-STD-516C MIL-STD-1629 DO-254 DO-178 FMEA Fault Tree Analysis Weibull analysis Predictive reliability analysis Qualification Test plans Reliability Engineering framework Highly Accelerated Life Testing Environmental Testing MIL-HDBK-217Plus HALT HASS

Senior Reliability Engineer

JLL (Jones Lang LaSalle)

NJ 53 days ago $140,000$160,000
Excel CMMS EAM ISO9001 ISO55001 RCM CbM PdM BAS PLC SCADA SQL Python R Tableau Building Automation Systems Energy Management Platforms Vibration Analysis Oil Analysis Infrared Thermography Ultrasound Motor Current Analysis CMMS/EAM systems Microsoft Excel

Senior Reliability Engineer

Medtronic

Mounds View South, MN 3 days ago $107,200$160,800
ISO_14971 ISO_13485 FMEA CAPA DFMEA PSUR Risk_Management_Report Microsoft_Word Microsoft_Excel Microsoft_PowerPoint Design_Control Root_Cause_Analysis Statistical_Data_Analysis CI/CD
Hybrid