Senior Data Lakehouse Architect (Databricks), Vice President

State Street

Actively hiring
Quincy, MA · Boston, MA Posted 14 days ago $120,000$202,500 / year

At a glance

AI generated

TL;DR

As a Senior Data Lakehouse Architect at State Street, you will join the Legal function to design and implement a robust data lakehouse platform on AWS and Databricks, supporting legal operations, contract intelligence, eDiscovery, and AI/ML use cases. Your daily tasks include defining multi-layered data architecture for various legal datasets, leading ETL/ELT pipeline development with Databricks and Spark, ensuring compliance with security and regulatory standards, and enabling advanced analytics through integration with AI platforms. You will leverage technologies such as Delta Lake, Unity Catalog, AWS S3, Glue, EMR, and IAM to build scalable, secure, and compliant data solutions. This role requires expertise in regulated environments, experience with unstructured data processing, and a strong background in enterprise-scale data lakehouse architecture on Databricks and AWS.

Skills

Databricks AWS Delta Lake Unity Catalog Python SQL Spark S3 Glue EMR Lambda Redshift IAM KMS CI/CD Terraform NLP DevOps Grafana Prometheus Git LLM

What you'll do

  • Define and implement end-to-end Legal Data Lakehouse architecture using Databricks on AWS.
  • Design multi-layered data architecture to support various legal datasets including contracts and regulatory feeds.
  • Lead development of ETL/ELT pipelines for structured and unstructured legal data integration.
  • Implement data governance frameworks with Unity Catalog and AWS-native security controls.
  • Architect data models supporting contract analytics, clause extraction, and AI use cases.

What we're looking for

  • 10+ years of experience in data architecture, engineering, or analytics platforms.
  • Deep expertise in Databricks and AWS data platforms, including Delta Lake, Unity Catalog, and S3.
  • Proven track record of architecting enterprise-scale data lakehouse solutions in regulated environments.
  • Advanced proficiency in Apache Spark (PySpark/Scala), SQL, and Python for ETL/ELT pipelines.
  • Strong understanding of data governance, security, and compliance with experience in IAM and KMS.

Market check

Salary context

This $120,000–$202,500 range sits above 40% of similar postings on FindRole.

Peer median band

$138,720$213,480

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$145,575$217,725

Middle half of comparable postings.

Based on 239 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About State Street

State Street Corporation is one of the world''s largest custodian banks and asset managers, providing investment servicing, investment management, and investment research to institutional investors. Industry: Financial Services & Asset Custody

State Street currently has 133 open roles on FindRole.

Listed pay typically runs $110,000–$180,000 across 131 roles with salary data.

Most-posted roles

View all roles at State Street

More like this

Similar roles

Data Engineer (Databricks), Assistant Vice President

State Street

US 14 days ago $110,000$177,500
Databricks AWS PySpark Python Delta Lake CI/CD SQL Power Platform APIs Docker Hadoop S3 Glue Lambda IAM KMS Unity Catalog Power BI Power Apps Terraform PostgreSQL Oracle JSON Parquet

Databricks Tech Lead - Vice President

Citi

Remote (3800 Citigroup Center Drive Building F Tampa, US) 11 days ago $113,840$170,760
AWS Databricks Python SQL Terraform CloudFormation CI/CD S3 Glue Athena SQS Lambda Delta Lake Spark SQL
Remote

Data Architect, Snowflake, Vice President

Blackrock

US 75 days ago $155,000$210,000
Snowflake CI/CD dbt Fivetran Airflow AWS Azure GCP SQL Python DevOps Data质量管理工具 云生态系统 Snowflake Cortex LLM functions AI_SQL 向量数据库 语义搜索 RAG模式 责任AI框架 SOX控制 审计框架 数据保留政策 监管报告要求

Data Architect, Snowflake, Vice President

Blackrock

US 70 days ago $155,000$210,000
Snowflake SQL dbt Fivetran CI/CD Python Airflow AWS Azure GCP Terraform Kubernetes Prometheus Grafana Dimensional Modeling Cloud Ecosystems Snowpark AI_SQL Vector Databases RAG Responsible AI SOX Controls Data Lineage DevOps

Data Architecture Group Manager, Senior Vice President

Citi

Remote (6400 Las Colinas Blvd Irving, US) 21 days ago $156,160$234,240
AWS GCP Snowflake Databricks Kafka Raven ETL Data Warehousing Data Lakes Data Lakehouse Data Governance Data Security Spark Presto/Trino Hadoop API Driven Architecture CI/CD
Remote