Staff, Data Scientist

Walmart

Actively hiring Posted this week
Bentonville, AR · Sunnyvale, CA Posted 4 days ago $110,000$220,000 / year

At a glance

AI generated

TL;DR

Join Walmart’s Merchandising Decision Sciences team as the founding Staff Data Scientist for External Data and Analytics Products. You will design, build, deploy, scale, and monitor ML systems that provide insights into the Rest of Market (ROM), shaping category decisions by modeling external retail data. Responsibilities include developing embedding-based classification models, GMV distribution normalization projections, and causal impact models to quantify market-share movement from merchandising actions. Utilizing Databricks on GCP, you will establish MLOps foundations for production-scale deployment, ensuring robust monitoring and retraining mechanisms. Required skills encompass extensive experience in scaling ML systems, proficiency with PySpark, Delta, MLflow, Unity Catalog, and GCP services like BigQuery and Vertex AI, along with expertise in vector embeddings, supervised classification, forecasting, causal inference, and Python/SQL programming. Ideal candidates have a track record of founding or solo data scientist roles, establishing MLOps from scratch, and handling production-scale models rigorously.

Skills

Databricks GCP PySpark Delta MLflow Unity Catalog BigQuery Vertex AI Python SQL XGBoost LightGBM CausalImpact CI/CD MLOps Forecasting Causal Inference Vector Embeddings

What you'll do

  • Design and build embedding-based classification models to align external product signals with Walmart's merchandising hierarchy.
  • Develop GMV distribution normalization and projection models for reconciling internal and external sales data across categories, time, and geography.
  • Construct causal impact models to quantify market-share movement from merchandising actions using advanced statistical methods.
  • Engineer ML systems on Databricks (on GCP) for production deployment, including PySpark, Delta, and MLflow integration.
  • Establish MLOps foundations for continuous monitoring and scaling of the ROM platform by new team members.
  • Own end-to-end lifecycle of models from problem framing to incident response in a production environment.

What we're looking for

  • Extensive hands-on experience shipping ML systems from notebook to production at scale and maintaining them through monitoring and retraining.
  • Deep expertise in developing and scaling ML on Databricks using PySpark, Delta, MLflow, Unity Catalog, and Model Serving.
  • Strong proficiency with GCP services including BigQuery, Vertex AI, Cloud Run, and Composer/Airflow for seamless integration with Databricks.
  • Proven skills in vector embeddings, supervised classification at scale, forecasting, distribution modeling, and causal inference for retail data.
  • Experience as a founding or solo data scientist on a program, establishing MLOps foundations from scratch and ensuring engineering-level rigor.

Employer

About Walmart

Walmart Inc. is the world''s largest retailer by revenue, operating a chain of hypermarkets, discount department stores, and grocery stores, as well as a growing e-commerce presence through Walmart.com. Industry: General Merchandise & Grocery Retail

Walmart currently has 486 open roles on FindRole.

Listed pay typically runs $117,000–$234,000 across 480 roles with salary data.

Most-posted roles

View all roles at Walmart