Director, Software Engineering

Walmart

Actively hiring
Sunnyvale, CA Posted 44 days ago $169,000$338,000 / year

At a glance

AI generated

TL;DR

Walmart Global Tech's Site Reliability Engineering team seeks a Director to lead the transformation of traditional SRE practices into AI-powered, self-healing systems. This role involves designing high-availability platforms and building tools for reliability, scalability, and performance optimization across Walmart’s e-commerce, stores, and omni-channel platform. You will collaborate with leadership to establish strategic plans, define service level objectives, and ensure systems meet strict availability requirements. The ideal candidate has expert-level AI/ML engineering experience, including deep learning frameworks like TensorFlow and PyTorch, and proficiency in cloud-native services, observability tools, and platform engineering practices such as Kubernetes and Terraform. Experience with large-scale retail or e-commerce systems is preferred, along with a strong background in technical leadership and cross-functional collaboration across diverse technology stacks.

Skills

AWS Azure GCP Kubernetes Docker Terraform TensorFlow PyTorch Prometheus Grafana Jaeger OpenTelemetry Istio Linkerd CI/CD AI/ML LLM-based agents Service mesh API gateway Microservices MLOps

What you'll do

  • Design and build tools to enhance reliability, latency, availability, and scalability of Walmart's tech stack.
  • Drive the creation and scaling of fault-tolerant systems in hybrid cloud infrastructure.
  • Establish strategic plans with leadership to improve mean time to detect and restore issues.
  • Define SLOs and SLIs to ensure systems meet service level agreements.
  • Develop AI/ML engineering solutions for intelligent capacity management and predictive performance optimization.

What we're looking for

  • Expert-level AI/ML engineering with deep knowledge of machine learning algorithms and production ML system deployment.
  • Advanced experience with agentic AI systems including multi-agent frameworks and autonomous decision-making platforms.
  • Comprehensive Site Reliability Engineering expertise, including service management and performance engineering for AI/ML systems.
  • Expert-level cloud engineering skills with Azure, GCP, AWS, and deep knowledge of cloud-native AI/ML services.
  • Deep observability and monitoring expertise using distributed tracing, metrics collection, log aggregation, APM tools, and predictive monitoring.
  • Industry experience in large-scale retail or e-commerce systems with strict availability requirements.
  • Technical leadership and cross-functional collaboration skills across diverse engineering teams.

Market check

Salary context

This $169,000–$338,000 range sits above 92% of similar postings on FindRole.

Peer median band

$143,000$237,125

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$165,000$214,500

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Walmart

Walmart Inc. is the world''s largest retailer by revenue, operating a chain of hypermarkets, discount department stores, and grocery stores, as well as a growing e-commerce presence through Walmart.com. Industry: General Merchandise & Grocery Retail

Walmart currently has 495 open roles on FindRole.

Listed pay typically runs $117,000–$234,000 across 487 roles with salary data.

Most-posted roles

View all roles at Walmart

More like this

Similar roles

Director, Software Engineering

JLL (Jones Lang LaSalle)

Remote (Usa-Corp New York Ny-New York, Madison, US) 14 days ago $300,000$375,000
Python Node.js Java Go C# AWS Azure PostgreSQL MongoDB DynamoDB CI/CD Terraform Docker Kubernetes RESTful APIs gRPC React Angular Vue.js
Remote

Director, Software Engineering

Walmart

(Usa) J Street Office Space Ar Bentonville Home Office, US 45 days ago $130,000$260,000
DevOps CI/CD Kubernetes AWS Terraform Python Java PostgreSQL Docker Prometheus Grafana AI ML Agile Scrum

Director, Software Engineering

Walmart

(Usa) J Street Office Space Ar Bentonville Home Office, US 45 days ago $130,000$260,000
DevOps CI/CD AWS Kubernetes Terraform Python Java Go Docker Prometheus Grafana PostgreSQL MongoDB Git Jenkins Ansible Cloud Financial Management AI/ML Agentic Systems

Director, Software Engineering

Walmart

(Usa) Crossman Excellence Building Ca Sunnyvale Home Office, US 53 days ago $169,000$338,000
DevOps AWS Kubernetes Terraform Python Java JavaScript PostgreSQL MongoDB CI/CD Docker Prometheus Grafana AdTech AI/ML

Director, Software Engineering

Capital One Financial

New York, Ny, US 16 days ago $269,100$307,200
Python Java Vue.js Postgres DynamoDB Milvus AWS Agile CI/CD Microservices Kubernetes Docker Terraform

Director of Software Engineering

Fiserv

Berkeley Heights, New Jersey, US 24 days ago $146,000$244,800
AWS Kubernetes CI/CD Java C Distributed systems Microservices architectures HPE NonStop Blockchain AI Machine learning Jenkins GitLab SonarQube