Reliability Engineering - Observability

Wise

Actively hiring
US Posted 74 days ago $65,000$85,000 / year

At a glance

AI generated

TL;DR

As a Software Engineer on Wise’s Observability team, you will own and evolve the centralized observability configuration for hundreds of product services, ensuring standardized and reliable telemetry across the company. You’ll work with open-source tools like Grafana and Elasticsearch to enhance observability tooling and collaborate closely with engineering teams to improve their monitoring experience. Your daily tasks include managing a high-scale environment where automation and efficiency are key, implementing robust configurations for metrics, logs, traces, alerts, and profiling, and supporting mission-critical services. Ideal candidates have software engineering experience in Java, Go, or Python, familiarity with observability tools like the ELK stack and OpenTelemetry, and a strong understanding of microservices architecture.

Skills

Python Java Go OpenTelemetry Grafana Elasticsearch Kubernetes CI/CD Docker Prometheus

What you'll do

  • Own and maintain centralised observability configuration for hundreds of product services.
  • Evolve observability tooling using open-source projects like Grafana and Elasticsearch.
  • Implement automation to manage high-scale observability estate efficiently.
  • Collaborate with engineering teams to enhance their observability experience.
  • Ensure robust, secure, cost-efficient, and developer-friendly observability tools.

What we're looking for

  • Experienced in software engineering with Java, Go, or Python.
  • Proficient in observability tools like Grafana and ELK stack.
  • Understanding of OpenTelemetry for application instrumentation.
  • Knowledge of microservices architecture and distributed systems.
  • Eagerness to support high-scale, mission-critical infrastructure.
  • Growth mindset with passion for building developer-friendly tools.

Market check

Salary context

This $65,000–$85,000 range sits above 1% of similar postings on FindRole.

Peer median band

$128,350$202,350

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$136,866$201,089

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Wise

Wise (formerly TransferWise) is a global technology company specializing in international money transfers and multi-currency accounts, offering transparent low-cost foreign exchange for individuals and businesses. Industry: Financial Technology & International Payments

Wise currently has 49 open roles on FindRole.

Most-posted roles

View all roles at Wise

More like this

Similar roles

Reliability Engineer*

3M

Remote (Us, Minnesota, Cottage Grove, US) 38 days ago $124,127$151,710
Reliability Centered Maintenance Total Productive Maintenance Predictive Maintenance Preventive Maintenance Key Performance Indicators Criticality Analysis Mechanical Drawings Analytical Tools Electrical Engineering Control Engineering Electro/Mechanical Engineering
Remote

Observability Lead - Cloud SRE & Network Reliability

Lam Research

Fremont, Ca,Us, US 41 days ago $114,000$253,000
Azure AWS GCP Prometheus Grafana Datadog PagerDuty Terraform Python Kubernetes CI/CD Ansible Go IaC SLA/SLO/SLI DR/BCP ExpressRoute DirectConnect CloudInterconnect MPLS LLM-based agents RAG patterns

Reliability Engineer II

Medtronic

Remote (Usa-Mn Mounds View South, US) 10 days ago $84,800$127,200
ISO13485 ISO14971 21CFR820 Design Controls V&V Risk Management System-thinking Process Mapping CI/CD Statistical Process Control DOE Mathematical Models Environmental Testing QMS
Remote

Engineer - Reliability Engineering & Enablement

Target

7000 Target Pkwy N,Ncd-0375 Brooklyn Park,Mn 55445, US 30 days ago $75,400$135,700
Golang JavaScript Node.js REST APIs ServiceNow Docker Kubernetes AWS Azure GitHub CI/CD PostgreSQL MongoDB Redis Prometheus Grafana GitLab Jenkins Terraform

Observability Developer

Adobe

Lehi, US 50 days ago $148,500$214,950
Go Python Docker Kubernetes AWS Azure Splunk Clickhouse Loki Elastic Grafana Cortex Tempo OpenTelemetry CI/CD Prometheus SLOs SLIs DevOps SRE