Data, Lakehouse and AI Data Platform Engineer, Associate

Goldman Sachs

Quick summary

Work type
On-site
Location
New York, NY
Salary
$115,000–$180,000 / yr
Posted
1 day ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $190k
This role $148k
$102k most similar roles pay here $238k

This role pays less than 81% of similar roles. Most pay $159,375–$220,975 — the shaded band above. At the midpoint, this role pays about $148k versus about $190k for comparable roles.

Based on 240 similar postings.

Employer

About Goldman Sachs

Goldman Sachs is a leading global investment banking, securities, and investment management firm providing financial services to corporations, financial institutions, governments, and individuals.

Goldman Sachs currently has 187 open roles on FindRole.

Listed pay typically runs $130,000–$250,000 across 60 roles with salary data.

Most-posted roles

View all roles at Goldman Sachs

At a glance

TL;DR · Data, Lakehouse and AI Data Platform Engineer, Associate

As a Data Engineer in the Lakehouse and AI Data Platform team, you will design, build, test, and support data pipelines and curated datasets on the firm’s modern data platform. Your responsibilities include developing batch and streaming data pipelines using Python or Java, ensuring they are production-ready and well-tested. You’ll also create raw, refined, and curated datasets for analytics and AI use cases while applying sound data modeling principles to ensure accuracy and consistency. Additionally, you will implement data quality controls and contribute to shared tooling that enhances platform reliability. This role requires strong hands-on programming skills in Python or Java, proficiency with SQL, and familiarity with distributed data processing frameworks like Apache Spark. You’ll work within a modern tech stack including Snowflake, Databricks, and Kafka, contributing to the firm’s AI and analytics capabilities by building scalable and reliable data solutions.

What you'll do

  • Design, build, and support data pipelines on the Lakehouse and AI data platform.
  • Develop refined datasets that accurately represent business entities for analytics use.
  • Implement controls to ensure completeness, accuracy, and consistency of production data.
  • Work with consumers to shape usable and well-documented data products aligned to needs.
  • Contribute to shared tooling or framework components to improve how the platform is used.

What we're looking for

  • 0-2+ years of data engineering or relevant experience
  • Strong hands-on programming skills in Python or Java
  • Proficient in SQL for troubleshooting, optimization, and analysis
  • Experience with Apache Spark and distributed data processing frameworks
  • Knowledge of modern data formats like JSON, Avro, Parquet
  • Understanding of temporal data modeling and schema design principles
  • Ability to work collaboratively in a fast-paced production environment

More like this

Similar roles

Software Engineer, Data Platform

DoorDash, Inc

San Francisco, CA +3 27 days ago $130,600$192,000
Apache Kafka Flink Spark Cassandra Clickhouse Iceberg DataHub Airflow Big Data infrastructure data governance CI/CD Python SQL Hadoop Docker Kubernetes AWS GCP Azure PostgreSQL Snowflake Git Jenkins