Machine Learning Operations Engineer II

S&P Global

Quick summary

Work type
On-site
Location
Cambridge, MA · New York, NY
Posted
51 days ago
Closes
Apr 3, 2027

Market check

Salary context

How this pay compares to similar roles

Similar $216k
$161k most similar roles pay here $271k

This listing doesn't post a salary. Most similar roles pay $181,587–$249,750.

Based on 239 similar postings.

Employer

About S&P Global

S&P Global delivers Essential Intelligence® that shapes decision making. We provide the world’s leading organizations with the right data, connected technologies and expertise they need to move ahead.

S&P Global currently has 25 open roles on FindRole.

Listed pay typically runs $142,000–$207,700 across 15 roles with salary data.

Most-posted roles

View all roles at S&P Global

At a glance

TL;DR · Machine Learning Operations Engineer II

Join Kensho’s MLOps team as a Machine Learning Operations Engineer II, where you will collaborate with ML engineers and infrastructure teams to develop robust, auditable, and user-friendly tools for the entire ML workflow. Your day-to-day involves iterating on processes, identifying pain points, and implementing scalable solutions that enable rapid experimentation and productionization of models. You’ll also champion open-source and third-party tools, enhance observability for LLMs and agentic applications, and stay abreast of emerging technologies to drive innovation within the team. Ideal candidates have 2+ years of experience in ML infra or similar roles, proficiency with Python and Kubernetes, and familiarity with distributed computing frameworks like Ray and Airflow. The role requires a deep understanding of cloud platforms (AWS), strong debugging skills across layers, and excellent communication abilities to foster adoption of best practices across multiple teams.

What you'll do

  • Develop and refine tools, services, and frameworks to enhance the robustness of ML workflows.
  • Collaborate with ML engineers to identify and resolve pain points in their processes.
  • Provide stable tooling for rapid experimentation and prototyping by ML researchers.
  • Train ML teams on best practices for efficient productionization of models and products.
  • Evaluate and integrate open-source solutions into Kensho’s platform ecosystem.
  • Implement scalable processes for model fine-tuning, reinforcement learning, and LLM evaluation.
  • Enhance observability of agentic applications to monitor performance and detect issues.

What we're looking for

  • 2+ years of experience in ML infrastructure, operations, or engineering.
  • Proficiency in Python and understanding of Kubernetes for managing distributed systems.
  • Experience with cloud platforms like AWS, including EKS and managed ML services.
  • Familiarity with distributed computing frameworks (e.g., Ray) and workflow orchestration tools (e.g., Airflow).
  • Ability to debug complex issues across infrastructure, networking, and application layers.
  • Strong communication skills for driving adoption of new tools and best practices.
  • Curiosity and eagerness to learn in a multi-disciplinary engineering environment.

More like this

Similar roles

Machine Learning Engineer

Apple Inc

Cupertino, CA 43 days ago $212,000$318,400
Python TensorFlow PyTorch scikit-learn Spark Snowflake MLflow CI/CD LLM APIs LangChain LlamaIndex AutoGen MLOps SQL

Machine Learning Engineer II

GEICO

Remote (Bethesda, MD) 9 days ago $105,000$215,000
Python TensorFlow PyTorch Scikit-learn AWS Azure GCP CI/CD Kubernetes Docker SQL Spark Version Control ETL Kafka
Remote Hybrid

Machine Learning Engineer III

Electronic Arts

Kirkland, WA 2 days ago $122,300$158,500
Python SQL AWS GCP Rust Docker Kubernetes CI/CD Terraform Spark
Hybrid

Applied Machine Learning Engineers

Shopify

US 36 days ago
Python TensorFlow PyTorch LLMs Reinforcement Learning Model Quantization Multimodal Models Docker Kubernetes CI/CD PostgreSQL AWS Petabyte-Scale Data Embeddings

Machine Learning Engineer

Adobe

San Jose 72 days ago $183,300$265,350
Python PyTorch LangChain LangGraph MCP ADK LLMs VLLMs CI/CD Docker AWS PostgreSQL Kubernetes