Machine Learning Engineer - HSTU

Shopify

Quick summary

Work type
On-site
Location
Ottawa, ON, Canada
Posted
42 days ago

Market check

Salary context

How this pay compares to similar roles

Similar $223k
$160k most similar roles pay here $278k

This listing doesn't post a salary. Most similar roles pay $195,875–$249,750.

Based on 239 similar postings.

Employer

About Shopify

Shopify is a leading global commerce platform that enables businesses of all sizes to start, grow, and manage their retail operations online and in-person. It provides tools for storefronts, payments, shipping, and marketing to millions of merchants worldwide.

Shopify currently has 25 open roles on FindRole.

Most-posted roles

View all roles at Shopify

At a glance

TL;DR · Machine Learning Engineer - HSTU

As a Machine Learning Engineering Lead or individual contributor at Shopify, you will join an innovative team focused on developing and deploying advanced HSTU models to enhance merchant growth and consumer experience. Your daily responsibilities include building scalable AI/ML system architectures, designing sophisticated inference pipelines that process billions of events in real-time, and implementing data pipelines for model training across diverse sources. You will also experiment with novel architectures, optimize production through techniques like negative sampling and distributed GPU training, and collaborate cross-functionally to deliver measurable business impact. The ideal candidate has expertise in recommendation systems, generative AI, Python, shell scripting, and experience with streaming and batch data pipelines, vector databases, and orchestration tools. This role involves working on a product that serves 100M+ users, requiring proficiency in handling large-scale data and complex technical challenges.

What you'll do

  • Develop and deploy Generative AI and HSTU-based recommendation models at scale.
  • Design scalable AI/ML system architectures supporting complex models.
  • Build inference pipelines processing billions of events for real-time recommendations.
  • Implement data pipelines for model training, fine-tuning, and evaluation across sources.
  • Optimize production processes using advanced techniques like negative sampling and ANN search.

What we're looking for

  • Mastery in recommendation systems and generative AI/LLMs.
  • Extensive experience deploying machine learning products at scale.
  • Expertise in building data pipelines from diverse sources.
  • Proficiency in Python, shell scripting, and orchestration tools.
  • Experience with distributed clusters and GPU optimization for ML.
  • Ability to communicate complex ML concepts effectively to all audiences.

More like this

Similar roles

Machine Learning Engineer

Adobe

San Jose 78 days ago $183,300$265,350
Python PyTorch LangChain LangGraph MCP ADK LLMs VLLMs CI/CD Docker AWS PostgreSQL Kubernetes

Machine Learning Engineer

Adobe

San Jose 88 days ago $161,700$234,150
Python TensorFlow PyTorch scikit-learn SparkML Kubernetes AWS CI/CD SQL Docker PostgreSQL MLOps

Machine Learning Engineer

Adobe

San Jose +1 8 days ago $161,700$234,150
Python AWS GCP Azure MLOps CI/CD Docker Kubernetes Prometheus Terraform PostgreSQL Git Agentic systems Multi-agent orchestration LLM-as-a-judge Retrieval-Augmented Generation RAG NLP pipelines

Machine Learning Engineer

Motorola Solutions

Los Angeles, CA 60 days ago $120,000$160,000
Python TensorFlow PyTorch scikit-learn MATLAB C++ signal processing wireless communication MIMO OFDM SDRs GPU acceleration embedded machine learning real-time systems adaptive modulation beamforming cognitive radio techniques 3GPP IEEE 802.11/15 military waveforms
Hybrid

Machine Learning Engineer

Q2

Austin, TX 50 days ago
Python TensorFlow PyTorch scikit-learn R Java cloud platforms scalable computing resources machine learning pipelines data analysis statistics optimization probability theory experimental methodologies CI/CD
Hybrid

Machine Learning Engineer

PayPal

Chicago, Illinois +1 94 days ago $117,500$174,350
Python TensorFlow scikit-learn Hadoop Spark SQL logistic_regression time_series_analysis random_forests SVMs XGBoost CNNs RNNs CI/CD
Hybrid