Site Reliability Engineer (Senior or Staff), Storage Layer Services (SLS)

MongoDB

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$126,000–$248,000 / yr
Posted
13 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $180k
This role $187k
$111k most similar roles pay here $263k

This role pays more than 53% of similar roles. Most pay $144,350–$214,900 — the shaded band above. At the midpoint, this role pays about $187k versus about $180k for comparable roles.

Based on 240 similar postings.

Employer

About MongoDB

MongoDB is a leading American software company that develops and provides commercial support for a popular, source-available document database. Designed to handle unstructured and structured data natively, its platform is purpose-built for modern cloud applications, analytics, and AI experiences.

MongoDB currently has 287 open roles on FindRole.

Listed pay typically runs $126,500–$209,000 across 104 roles with salary data.

Most-posted roles

View all roles at MongoDB

At a glance

TL;DR · Site Reliability Engineer (Senior or Staff), Storage Layer Services (SLS)

As a Senior Site Reliability Engineer (SRE) joining MongoDB’s small, senior team in Boston, New York City, Raleigh, Miami, Pittsburgh, or remotely within the Eastern/Central time zones, you will play a pivotal role in defining Service Level Objectives and shaping capacity plans for Atlas storage services. Your day-to-day responsibilities include building reliable, durable, and operationally safe systems, identifying critical metrics to ensure service health, and participating in 24/7 on-call rotations. You’ll work with distributed storage systems, optimize infrastructure performance from the application level down to the kernel, and leverage Python or Go for automation and efficiency. This role requires experience with Kubernetes, cloud platforms like AWS, GCP, or Azure, Linux internals, and networking concepts, as you tackle the multi-year roadmap of MongoDB’s cloud storage architecture at scale.

What you'll do

  • Define and implement Service Level Objectives (SLOs) for storage services.
  • Shape capacity plans to ensure reliability and durability of the storage layer.
  • Build self-healing infrastructure to maintain service availability and resilience.
  • Configure key metrics to detect incidents and measure service performance.
  • Participate in 24/7 on-call rotations to resolve critical issues promptly.

What we're looking for

  • At least 6 years of experience in software development and operating distributed systems.
  • Proficiency in Python, Go, or a similar programming language.
  • Experience with stateful storage or database systems at scale.
  • Strong customer focus and efficiency in processes and operations.
  • Expertise in cloud infrastructure platforms like AWS, GCP, or Azure.
  • Understanding of Linux OS internals and networking concepts.
  • Preference for automation over manual processes.

More like this

Similar roles