Lead Site Reliability Engineer

Alteryx

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$136,000–$177,000 / yr
Posted
59 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $177k
This role $156k
$126k most similar roles pay here $230k

This role pays less than 68% of similar roles. Most pay $142,450–$210,912 — the shaded band above. At the midpoint, this role pays about $156k versus about $177k for comparable roles.

Based on 240 similar postings.

Employer

About Alteryx

Alteryx is a leading AI-ready data and analytics company that powers actionable insights to help organizations drive smarter, faster decisions with data.

Alteryx currently has 7 open roles on FindRole.

Listed pay typically runs $136,000–$177,000 across 7 roles with salary data.

Most-posted roles

View all roles at Alteryx

At a glance

TL;DR · Lead Site Reliability Engineer

As a Lead SRE at Alteryx, Inc., you will lead technical initiatives to ensure the reliability of our modern split-plane, multi-region SaaS platform for enterprise customers. Your daily responsibilities include defining and driving reliability strategies, establishing and operationalizing service level objectives (SLOs), and leading architecture reviews to enhance system scalability and cost efficiency. You’ll also champion automation efforts, mentor senior engineers, and collaborate across teams to align engineering with product priorities. The ideal candidate has over six years of experience in delivering complex distributed systems or SaaS platforms, proficiency in languages like Python, Java, C++, or JavaScript, and deep expertise in Kubernetes, CI/CD pipelines, GitOps, observability tools, and disaster recovery practices. This role involves working on a large-scale platform that serves enterprise customers globally, emphasizing the importance of multi-region resilience and failover design.

What you'll do

  • Define and drive reliability strategy for multi-region SaaS platform systems.
  • Establish and operationalize SLOs, SLAs, and error budgets to inform planning.
  • Lead initiatives to improve MTTR, incident prevention, and overall service health.
  • Own end-to-end incident management, driving systemic fixes and long-term improvements.
  • Mentor senior engineers and act as a technical leader across teams.
  • Champion automation and modernization efforts for reliability improvements.

What we're looking for

  • 6+ years leading complex distributed systems or SaaS platforms
  • Strong experience with multi-region split-plane architectures (control-plane/data-plane)
  • Proven track record in improving system reliability, MTTR, and SLOs at scale
  • Proficiency in Python, Java, C++, or JavaScript
  • Deep expertise in Kubernetes, CI/CD, GitOps, observability, and incident management
  • Strong leadership skills with experience mentoring senior engineers and influencing cross-team decisions

More like this

Similar roles

Site Reliability Engineer

Booz Allen Hamilton

Herndon, VA 40 days ago $86,800$198,000
Java Spring Boot CI/CD Agile Bitbucket GitLab Kubernetes NiFi Kafka MongoDB Elasticsearch ArgoCD

Site Reliability Engineer

Booz Allen Hamilton

Panama City, FL 1 day ago $86,800$198,000
VMware SAN storage v6.x VMware Physical servers Storage systems Network infrastructures User management tools Monitoring tools CI/CD PostgreSQL AWS Kubernetes Docker Prometheus Grafana

Site Reliability Engineer

Booz Allen Hamilton

Herndon, VA 64 days ago $99,000$225,000
Java Spring Boot CI/CD Agile Bitbucket GitLab Kubernetes ArgoCD MongoDB Elasticsearch NiFi Kafka Docker Terraform AWS Grafana Prometheus Python Go

Site Reliability Engineer

Equifax

St. Louis, Missouri 52 days ago
AWS GCP Terraform Jenkins Python Bash Docker Kubernetes CI/CD Prometheus PostgreSQL Linux Windows Ansible Chef
Hybrid

Site Reliability Engineer

Shopify

Europe 36 days ago
Kubernetes Docker CI/CD Python Go PostgreSQL AWS GCP Prometheus Grafana Terraform GitOps