Data, Lakehouse and AI Data Platform Engineer, Associate

Goldman Sachs

Quick summary

Work type: On-site
Location: Dallas, TX
Posted: 1 day ago
Nearby: 99+ roles within 25 mi

Market check

Salary context

How this pay compares to similar roles

Similar $189k

$148k most similar roles pay here $233k

This listing doesn't post a salary. Most similar roles pay $161,300–$217,600.

Based on 240 similar postings.

Employer

About Goldman Sachs

Goldman Sachs is a leading global investment banking, securities, and investment management firm providing financial services to corporations, financial institutions, governments, and individuals.

Goldman Sachs currently has 187 open roles on FindRole.

Listed pay typically runs $130,000–$250,000 across 60 roles with salary data.

Most-posted roles

View all roles at Goldman Sachs

At a glance

TL;DR · Data, Lakehouse and AI Data Platform Engineer, Associate

Role Posting Log in to save

As a Data Engineer in the Lakehouse and AI Data Platform team at Goldman Sachs, you will design, build, test, and support data pipelines and curated datasets on the firm’s modern data platform. Your responsibilities include developing robust data models, enhancing batch and streaming data pipelines using Python or Java, and ensuring data quality through validation and reconciliation processes. You will work with SQL, Apache Spark, Kafka, Snowflake, and other technologies to deliver scalable and reliable solutions that support analytics and operational decision-making. This role requires strong programming skills, familiarity with distributed data processing frameworks, and the ability to contribute to shared tooling for platform improvements. Ideal candidates are technically proficient, detail-oriented problem solvers who can work effectively in a fast-paced environment.

Skills

Python Java SQL Apache Spark Kafka JSON Avro Parquet Snowflake Apache Iceberg Databricks Hadoop CI/CD Kubernetes

What you'll do

Design, build, and support batch and streaming data pipelines on the Lakehouse and AI data platform.
Develop raw, refined, and curated datasets to support analytics, reporting, and AI use cases.
Implement controls for validating completeness, accuracy, and consistency of data across pipelines.
Work with consumers to shape data products that are usable, well-documented, and aligned to business needs.
Contribute to shared tooling or framework components to improve platform functionality and reliability.

What we're looking for

Bachelor’s or master’s degree in a relevant discipline with strong quantitative skills or data engineering expertise.
Strong hands-on programming experience in Python or Java, and good working knowledge of SQL.
Experience building or supporting production data pipelines using distributed data processing frameworks like Apache Spark.
Understanding of temporal data modelling, schema design, partitioning, clustering, and other performance techniques at scale.
Familiarity with software engineering fundamentals including version control, testing, release discipline, and CI/CD practices.
Ability to work closely with stakeholders and partner teams, contributing to shared tooling or platform improvements.

Similar roles

Data, Lakehouse and AI Data Platform Engineer, Associate

Goldman Sachs

New York, NY 1 day ago $115,000–$180,000

Python Java SQL Apache Spark Kafka JSON Avro Parquet Snowflake Apache Iceberg Databricks Hadoop CI/CD Kubernetes

Save

Vice President, Software Engineering - Data, Lakehouse and AI Data Platform

Goldman Sachs

Dallas, TX 1 day ago

Python Java SQL Apache Spark Kafka JSON Avro Parquet Snowflake Apache Iceberg Databricks Hadoop CI/CD Kubernetes

Save

Vice President, Software Engineering - Data, Lakehouse and AI Data Platform

Goldman Sachs

New York, NY 1 day ago $130,000–$250,000

Python Java SQL Apache Spark Kafka JSON Avro Parquet Snowflake Apache Iceberg Databricks Hadoop CI/CD Kubernetes

Save

Software Engineer, Data Platform

DoorDash, Inc

San Francisco, CA +3 27 days ago $130,600–$192,000

Apache Kafka Flink Spark Cassandra Clickhouse Iceberg DataHub Airflow Big Data infrastructure data governance CI/CD Python SQL Hadoop Docker Kubernetes AWS GCP Azure PostgreSQL Snowflake Git Jenkins

Save

Lead Data Engineer, Wealth R&D

State Street

Boston, MA 21 days ago $120,000–$202,500

Python AWS Azure Google Cloud Platform SQL NoSQL CI/CD Terraform Kubernetes Microservices Serverless Docker PostgreSQL Snowflake Apache Hadoop Apache Spark DataBricks Redis MongoDB Elasticsearch Kafka Airflow

Save

Software Engineer, Data Solutions, AI & Data Platforms

Apple Inc

Sunnyvale, CA 23 days ago $147,400–$272,100

Python Scala Java Snowflake BigQuery AWS Azure Google Cloud Spark Kafka Streamlit Superset Tableau Looker RESTful services ETL CI/CD

Save