Staff Site Reliability Engineer, Fabric

MongoDB

Remote

Quick summary

Work type
Remote
Location
CanadaNew York, NY
Salary
$144,000–$200,000 / yr
Posted
13 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $178k
This role $172k
$132k most similar roles pay here $230k

This role pays less than 61% of similar roles. Most pay $145,675–$209,750 — the shaded band above. At the midpoint, this role pays about $172k versus about $178k for comparable roles.

Based on 239 similar postings.

Employer

About MongoDB

MongoDB is a leading American software company that develops and provides commercial support for a popular, source-available document database. Designed to handle unstructured and structured data natively, its platform is purpose-built for modern cloud applications, analytics, and AI experiences.

MongoDB currently has 287 open roles on FindRole.

Listed pay typically runs $126,500–$209,000 across 104 roles with salary data.

Most-posted roles

View all roles at MongoDB

At a glance

TL;DR · Staff Site Reliability Engineer, Fabric

As a Senior Site Reliability Engineer (SRE) on the Fabric team within Platform Engineering at MongoDB, you will play a crucial role in building and maintaining robust infrastructure for secure communication between services. Your day-to-day responsibilities include developing and implementing network architecture, service mesh, and edge load balancing solutions to ensure high availability and reliability of MongoDB’s multi-cloud environment. You will leverage your deep expertise in networking fundamentals, distributed systems, and automation to collaborate with internal teams on best practices and technical guidance for service connectivity. The ideal candidate has over a decade of experience in software development and operating distributed systems, familiarity with modern cloud infrastructure (AWS, Azure, or GCP), and a strong preference for automated processes. This role is pivotal in supporting MongoDB’s globally connected network that underpins its critical services.

What you'll do

  • Design and implement secure network architecture for multi-cloud environments.
  • Develop and maintain service mesh and load-balancing solutions in a distributed system.
  • Automate operational processes to ensure efficiency and reliability of infrastructure.
  • Provide technical support and guidance on best practices for internal teams.
  • Participate in 24/7 on-call rotation to resolve critical network issues promptly.

What we're looking for

  • 10+ years of experience in distributed systems and networking fundamentals.
  • Deep expertise in TCP/IP, DNS, TLS/mTLS, BGP, tunnels, overlays, and SDN principles.
  • Familiarity with cloud-based infrastructure primitives like VPCs, subnets, routing, and CDNs.
  • Strong knowledge of service mesh and load-balancing concepts for multi-cloud environments.
  • Customer-focused mindset with a preference for automation in operations processes.

More like this

Similar roles

Staff Site Reliability Engineer, Fabric

MongoDB

Remote (New York, NY) +3 13 days ago $127,000$249,000
AWS Azure GCP Kubernetes Terraform Python DNS TLS mTLS BGP VPCs subnetting routing VPNs peering private_link CDNs service_mesh load_balancing CI/CD IPv6 SDN
Remote

Staff Site Reliability Engineer

TransUnion

Chicago +4 53 days ago $112,500$187,500
GCP Kubernetes CI/CD Prometheus Grafana PostgreSQL MySQL Redis Terraform Python Bash Go VPC DNS Load Balancing Firewall Rules VPN Private Service Connect LLM Orchestration Vector Databases Model Serving Infrastructure AI Observability
Hybrid

Staff Site Reliability Engineer

CME Group

Chicago, IL 44 days ago $132,100$220,100
GCP Kubernetes Python Terraform ArgoCD Go Node.js CI/CD Distributed Systems Generative AI Agile PostgreSQL GitOps CICD SLI SLO Error Budgets
Hybrid

Staff Site Reliability Engineer

CME Group

Chicago, IL 36 days ago $132,100$220,100
Google Cloud Services Terraform GKE CloudFormation Chef Java Python Bash Go Typescript Rust CI/CD OpenTelemetry Prometheus SRE Kubernetes Docker GitOps Security frameworks Observability
Hybrid

Site Reliability Engineer

Equifax

St. Louis, Missouri +1 62 days ago
AWS GCP Terraform Jenkins Python Bash Docker Kubernetes CI/CD Prometheus PostgreSQL Linux Windows Ansible Chef
Hybrid

Site Reliability Engineer

Shopify

Europe 46 days ago
Kubernetes Docker CI/CD Python Go PostgreSQL AWS GCP Prometheus Grafana Terraform GitOps