Lead Site Reliability Engineer

Alteryx

Remote

Quick summary

Work type: Remote
Location: Remote
Salary: $136,000–$177,000 / yr
Posted: 59 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $177k

This role $156k

$126k most similar roles pay here $230k

This role pays less than 68% of similar roles. Most pay $142,450–$210,912 — the shaded band above. At the midpoint, this role pays about $156k versus about $177k for comparable roles.

Based on 240 similar postings.

Employer

About Alteryx

Alteryx is a leading AI-ready data and analytics company that powers actionable insights to help organizations drive smarter, faster decisions with data.

Alteryx currently has 7 open roles on FindRole.

Listed pay typically runs $136,000–$177,000 across 7 roles with salary data.

Most-posted roles

View all roles at Alteryx

At a glance

TL;DR · Lead Site Reliability Engineer

Apply Now Log in to save

As a Lead SRE at Alteryx, Inc., you will lead technical initiatives to ensure the reliability of our modern split-plane, multi-region SaaS platform for enterprise customers. Your daily responsibilities include defining and driving reliability strategies, establishing and operationalizing service level objectives (SLOs), and leading architecture reviews to enhance system scalability and cost efficiency. You’ll also champion automation efforts, mentor senior engineers, and collaborate across teams to align engineering with product priorities. The ideal candidate has over six years of experience in delivering complex distributed systems or SaaS platforms, proficiency in languages like Python, Java, C++, or JavaScript, and deep expertise in Kubernetes, CI/CD pipelines, GitOps, observability tools, and disaster recovery practices. This role involves working on a large-scale platform that serves enterprise customers globally, emphasizing the importance of multi-region resilience and failover design.

Skills

Kubernetes CI/CD GitOps ArgoCD SLO SLA observability Infrastructure as Code chaos engineering Datadog Grafana Python Java C++ JavaScript AWS Azure Google Cloud Platform PostgreSQL MySQL Redis Docker Terraform

What you'll do

Define and drive reliability strategy for multi-region SaaS platform systems.
Establish and operationalize SLOs, SLAs, and error budgets to inform planning.
Lead initiatives to improve MTTR, incident prevention, and overall service health.
Own end-to-end incident management, driving systemic fixes and long-term improvements.
Mentor senior engineers and act as a technical leader across teams.
Champion automation and modernization efforts for reliability improvements.

What we're looking for

6+ years leading complex distributed systems or SaaS platforms
Strong experience with multi-region split-plane architectures (control-plane/data-plane)
Proven track record in improving system reliability, MTTR, and SLOs at scale
Proficiency in Python, Java, C++, or JavaScript
Deep expertise in Kubernetes, CI/CD, GitOps, observability, and incident management
Strong leadership skills with experience mentoring senior engineers and influencing cross-team decisions

Similar roles

Site Reliability Engineer

Booz Allen Hamilton

Herndon, VA 40 days ago $86,800–$198,000

Java Spring Boot CI/CD Agile Bitbucket GitLab Kubernetes NiFi Kafka MongoDB Elasticsearch ArgoCD

Save

Site Reliability Engineer

Booz Allen Hamilton

Panama City, FL 1 day ago $86,800–$198,000

VMware SAN storage v6.x VMware Physical servers Storage systems Network infrastructures User management tools Monitoring tools CI/CD PostgreSQL AWS Kubernetes Docker Prometheus Grafana

Save