Principal, Software Engineer - Observability

Walmart

Quick summary

Work type
On-site
Location
Sunnyvale, CA
Salary
$143,000–$286,000 / yr
Posted
9 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $183k
This role $214k
$114k most similar roles pay here $304k

This role pays more than 80% of similar roles. Most pay $154,000–$211,200 — the shaded band above. At the midpoint, this role pays about $214k versus about $183k for comparable roles.

Based on 239 similar postings.

Employer

About Walmart

Walmart Inc. is the world''s largest retailer by revenue, operating a chain of hypermarkets, discount department stores, and grocery stores, as well as a growing e-commerce presence through Walmart.com. Industry: General Merchandise & Grocery Retail

Walmart currently has 189 open roles on FindRole.

Listed pay typically runs $110,000–$220,000 across 181 roles with salary data.

Most-posted roles

View all roles at Walmart

At a glance

TL;DR · Principal, Software Engineer - Observability

As an observability principal engineer, you will lead the architecture and development of cloud-native observability designs and managed services, focusing on scalability, latency, and fault-tolerance in large-scale distributed systems. Your day-to-day responsibilities include designing telemetry software systems using technologies like data models, metric libraries, distributed tracing, and real-time streaming pipelines, while collaborating with enterprise architects, product managers, and engineers to bring R&D projects into production. Proficiency in Java is essential, along with experience in API/lib/SDK development, cloud infrastructures, and large-scale distributed systems. You will also leverage TSDBs for anomaly detection and system behavior projections, utilizing AI and machine learning techniques. This role requires a deep understanding of cloud technologies, real-time telemetry pipelines, and data warehousing, as well as strong communication skills to socialize architectural designs with internal and external stakeholders.

What you'll do

  • Design and develop large-scale distributed systems focusing on scalability, latency, and fault tolerance.
  • Create visionary software architectures for observability products using telemetry technologies like TSDBs and real-time data streaming pipelines.
  • Lead research initiatives for cloud-native designs in public and private clouds, integrating AI for anomaly detection.
  • Collaborate with cross-functional teams to bring telemetry R&D projects into production at an enterprise-wide scale.
  • Utilize Java language proficiency to develop applications, libraries, SDKs, and services for observability solutions.

What we're looking for

  • BS/MS in Computer Science or Engineering with over 10 years of software engineering, design, and architecture experience.
  • Proficient in Java language and frameworks, with extensive development experience in Java applications, libraries, SDKs, and services.
  • Strong leadership in enterprise-level software implementation and architectural research, evaluation, creation, and distributed system deployment.
  • Experience in full-stack cloud software development, including API/lib/SDK development, integration, and utilization.
  • Expertise in large-scale distributed systems, real-time telemetry pipelines, TSDBs, data warehousing, and ETL processes.

More like this

Similar roles

Senior Software Engineer, Observability

MongoDB

Dublin, Ireland 14 days ago
MongoDB Python Java C# Go Kafka Flink TypeScript React Node.js PostgreSQL CI/CD Docker Git Linux RESTful APIs GraphQL Messaging Queues Monitoring Tools Cloud Platforms
Hybrid

Senior Software Engineer — Observability

Apple Inc

Cary, NC 40 days ago
OpenTelemetry Grafana Kubernetes Python Java Kotlin Go Prometheus Terraform Docker CI/CD PostgreSQL Redis RabbitMQ Splunk Datadog LLMs AI APIs SRE CI/CD systems

Senior Software Engineer — Observability

Apple Inc

Austin, TX 40 days ago
OpenTelemetry Grafana Datadog Kubernetes Python Java Kotlin Go Prometheus Terraform CI/CD PostgreSQL NoSQL Redis RabbitMQ LLMs AI APIs SRE Docker AWS Azure