Senior Site Reliability Engineer, Fleet Management

MongoDB

Remote

Quick summary

Work type
Remote
Location
Dublin, Ireland
Salary
$127,000–$249,000 / yr
Posted
13 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $178k
This role $188k
$112k most similar roles pay here $264k

This role pays more than 57% of similar roles. Most pay $146,250–$209,750 — the shaded band above. At the midpoint, this role pays about $188k versus about $178k for comparable roles.

Based on 240 similar postings.

Employer

About MongoDB

MongoDB is a leading American software company that develops and provides commercial support for a popular, source-available document database. Designed to handle unstructured and structured data natively, its platform is purpose-built for modern cloud applications, analytics, and AI experiences.

MongoDB currently has 287 open roles on FindRole.

Listed pay typically runs $126,500–$209,000 across 104 roles with salary data.

Most-posted roles

View all roles at MongoDB

At a glance

TL;DR · Senior Site Reliability Engineer, Fleet Management

As a Senior Platform Engineer on the Fleet Management team within SRE at MongoDB, you will contribute to developing and maintaining a scalable and secure Kubernetes-based runtime environment that supports product needs across the organization. Your day-to-day responsibilities include providing internal support for our Kubernetes ecosystem, participating in 24/7 on-call rotations, and driving systemic fixes through blameless post-mortems. You should have deep experience with containerization technologies like Kubernetes, proficiency in Go or Python, and a solid understanding of Linux operating system internals and networking concepts. Additionally, expertise in cloud infrastructure platforms such as AWS, GCP, or Azure, along with tools like Terraform, Crossplane, and ACK for provisioning infrastructure, is essential. This role involves designing secure multi-tenant runtime environments and automating processes to eliminate manual operations, aligning with the team’s focus on building software solutions to reduce operational overhead in a rapidly scaling environment.

What you'll do

  • Develop and maintain a scalable, secure Kubernetes runtime environment for product needs.
  • Provide internal support for the Kubernetes ecosystem, assisting engineering teams with domain-specific issues.
  • Participate in 24/7 on-call rotation to resolve critical production issues promptly.
  • Design and implement multi-tenant runtime environments from first principles.
  • Automate infrastructure provisioning using tools like Terraform, Crossplane, and ACK.
  • Debug complex production issues and drive them to resolution for continuous improvement.

What we're looking for

  • 6+ years of experience in software development and operating distributed systems.
  • Proficiency in Go or Python with a commitment to code quality and testing practices.
  • Deep experience using and extending Kubernetes for containerization technologies.
  • Solid understanding of Linux OS internals and networking concepts (TCP/IP, DNS).
  • Strong operational ownership and track record of debugging complex production issues.

More like this

Similar roles

Senior Site Reliability Engineer

Adobe

San Jose 69 days ago $208,300$301,600
AWS Kubernetes Terraform Python Go CI/CD Infrastructure as Code Docker PostgreSQL Security hardening AI-enabled platforms Cross-team leadership Developer experience optimization

Senior Site Reliability Engineer

Carta

San Francisco, California +2 73 days ago $181,688$213,750
AWS Terraform Python Kubernetes Docker Postgres Prometheus Grafana CI/CD gRPC Ansible ELK Stack Datadog GraphQL
Hybrid

Senior Site Reliability Engineer

Oracle

Reston, VA +2 38 days ago
Oracle Linux Ansible Terraform Python Bash Prometheus Grafana GlusterFS Active Directory LDAP Kerberos CI/CD PostgreSQL Docker Kubernetes Git Jenkins

Senior Site Reliability Engineer

Oracle

Nashville, TN +1 33 days ago $79,100$158,200
AWS Azure GCP OCI Major Incident Management Agile Terraform Docker CI/CD RESTful APIs Jenkins Chef Ansible Prometheus Grafana Python Go