Lead Site Reliability Engineer

Alloy

Hybrid

Quick summary

Work type
Hybrid
Location
NY
Salary
$179,000–$226,000 / yr
Posted
83 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $181k
This role $202k
$133k most similar roles pay here $236k

This role pays more than 71% of similar roles. Most pay $152,150–$209,750 — the shaded band above. At the midpoint, this role pays about $202k versus about $181k for comparable roles.

Based on 240 similar postings.

Employer

About Alloy

Alloy is an identity decisioning platform that provides fraud prevention, compliance, and credit underwriting solutions for banks, credit unions, and fintechs to automate identity verification decisions. Industry: Financial Technology & Identity Verification

Alloy currently has 8 open roles on FindRole.

Listed pay typically runs $153,500–$192,000 across 8 roles with salary data.

Most-posted roles

View all roles at Alloy

At a glance

TL;DR · Lead Site Reliability Engineer

Join Alloy’s Infrastructure Team as an experienced engineer with 10+ years in infrastructure or SRE roles, where you will design and build automated systems for managing large-scale Kubernetes clusters and databases. Your day-to-day involves reducing operational toil by automating manual processes, building internal tooling for safe self-service changes, and enhancing the reliability of complex distributed systems. You’ll work closely with other engineers to contribute to architecture decisions, write production-quality code, and participate in on-call rotations focused on preventing incidents. Proficiency in software engineering, Infrastructure as Code tools like Terraform, observability tools such as Datadog, and a programming language like Python or Go is essential. Experience with AWS and Kubernetes at scale is highly desirable for this role that emphasizes automation, system scalability, and reliability.

What you'll do

  • Design and build systems to automate infrastructure management at scale.
  • Reduce operational toil by automating manual processes into reliable workflows.
  • Build internal tooling for safe self-service changes for other engineers.
  • Improve reliability and resilience of Kubernetes, databases, and services.
  • Implement and evolve deployment systems for applications in Kubernetes.

What we're looking for

  • 10+ years experience in infrastructure, SRE, or software engineering roles.
  • Strong software engineering skills with proficiency in at least one programming language.
  • Experience managing production infrastructure at scale using cloud and containerized systems.
  • Expertise in Infrastructure as Code (e.g., Terraform) and running/troubleshooting distributed systems.
  • Proficiency with observability tools like Datadog, CloudWatch, ELK/EFK for monitoring and debugging.
  • Participation in on-call rotations and a focus on preventing incidents through system improvements.

More like this

Similar roles

Lead Site Reliability Engineer

Alteryx

Remote 81 days ago $136,000$177,000
Kubernetes CI/CD GitOps ArgoCD SLO SLA observability Infrastructure as Code chaos engineering Datadog Grafana Python Java C++ JavaScript AWS Azure Google Cloud Platform PostgreSQL MySQL Redis Docker Terraform
Remote

Lead Site Reliability Engineer

JPMorgan Chase

New York, NY 9 days ago $152,000$215,000
CI/CD Kubernetes Docker Terraform JavaScript Go Python GraphQL Kafka OpenTelemetry AI Jenkins GitLab ECS

Lead Site Reliability Engineer

JPMorgan Chase

Plano, TX 2 days ago
AWS Azure Python Grafana Dynatrace Prometheus Cloudwatch Splunk CI/CD AI ML Site Reliability Engineering Observability Monitoring Telemetry Collection

Site Reliability Engineer

Equifax

St. Louis, Missouri +1 74 days ago
AWS GCP Terraform Jenkins Python Bash Docker Kubernetes CI/CD Prometheus PostgreSQL Linux Windows Ansible Chef
Hybrid

Site Reliability Engineer

Shopify

Europe 58 days ago
Kubernetes Docker CI/CD Python Go PostgreSQL AWS GCP Prometheus Grafana Terraform GitOps