Senior Site Reliability Engineer

CoStar Group

Hybrid Actively hiring
Arlington, VA Posted 11 days ago

At a glance

AI generated

TL;DR

As a Senior Site Reliability Engineer, you will join our dynamic team responsible for managing petabytes of real-time data and millions of active users across a globally distributed platform. Your primary responsibilities include leading the design of observability, automation, and incident response frameworks while ensuring reliability for massive-scale systems used worldwide. You will collaborate closely with engineering, product, and data teams to evolve core platforms and drive initiatives around performance optimization, failover strategies, disaster recovery, and cost efficiency. The role requires expertise in Kubernetes, Docker, AWS, CloudFormation, Terraform, and experience with relational or document databases, as well as REST API development. You will thrive on solving complex reliability challenges at scale, bringing a product mindset to enhance customer experience and business impact.

Skills

AWS Kubernetes Docker Terraform CloudFormation Python Java C# NodeJS Bash PCI compliance REST API Microservices CDN PostgreSQL MySQL Azure Google Cloud CI/CD

What you'll do

  • Lead the design of observability, automation, and incident response frameworks.
  • Own reliability for massive-scale systems used globally.
  • Drive initiatives around performance, failover, disaster recovery, and cost optimization.
  • Influence platform direction and infrastructure strategy across cloud and hybrid deployments.
  • Mentor others and lead by example in high-performance environments.

What we're looking for

  • 5+ years of experience managing large distributed systems with mass scale architecture.
  • Expertise in designing, implementing APM and observability solutions for large-scale data infrastructure.
  • Production experience with Kubernetes, Docker, and container deployment strategies.
  • Infrastructure as code using Cloudformation, Terraform, or similar platforms; AWS expertise required.
  • Experience with relational databases, document databases, and configuring CDNs for performance.
  • Background in Bash scripting and proficiency in programming languages like Java, Python, NodeJS.
  • Solid skills in sizing and estimation techniques for small to medium size tasks and projects.

Market check

Salary context

This listing doesn't show a salary. Similar roles on FindRole typically pay $119,800–$198,300.

Peer median band

$119,800$198,300

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$137,000$198,859

Middle half of comparable postings.

Based on 239 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About CoStar Group

CoStar Group is the leading provider of commercial real estate information, analytics, and online marketplaces, including CoStar, Apartments.com, and LoopNet platforms. Industry: Commercial Real Estate Data & Analytics

CoStar Group currently has 31 open roles on FindRole.

Listed pay typically runs $170,000–$222,000 across 11 roles with salary data.

Most-posted roles

View all roles at CoStar Group

More like this

Similar roles

Senior Site Reliability Engineer

Oracle

US 14 days ago $79,100$158,200
Oracle Cloud Infrastructure Kubernetes Python Go Bash CI/CD Terraform Prometheus Grafana Linux Networking Docker SRE Incident Response SLIs/SLOs Resilience Engineering FedRAMP 3PAO

Senior Site Reliability Engineer

Adobe

San Jose, US 51 days ago $208,300$301,600
AWS Kubernetes Terraform Python Go CI/CD Infrastructure as Code Docker PostgreSQL Security hardening AI-enabled platforms Cross-team leadership Developer experience optimization

Senior Site Reliability Engineer

Carta

Seattle, Washington, US 55 days ago $181,688$213,750
AWS Terraform Python Kubernetes Docker Postgres Prometheus Grafana CI/CD gRPC Ansible ELK Stack Datadog GraphQL

Site Reliability Engineer

Booz Allen Hamilton

Locations Herndon, Virginia, US 32 days ago $86,800$198,000
Java Spring Boot CI/CD Agile Bitbucket GitLab Kubernetes NiFi Kafka MongoDB Elasticsearch ArgoCD

Site Reliability Engineer

Booz Allen Hamilton

Locations Belcamp, Maryland, US 11 days ago $86,900$198,000
VMware SAN storage v6.x VMware High-capacity storage solutions Data center design Virtualized architecture CI/CD User management tools Monitoring tools Python PostgreSQL AWS Kubernetes Docker Prometheus Grafana

Site Reliability Engineer

Booz Allen Hamilton

US 9 days ago $86,900$198,000
VMware SAN storage v6.x VMware High-capacity storage solutions Data center design Virtualized architecture User management tools Monitoring tools CI/CD PostgreSQL AWS Kubernetes Docker Prometheus Grafana