IAM IGA Data Engineer, Assistant Vice President

State Street

Hybrid

Quick summary

Work type
Hybrid
Location
Princeton, NJStamford, CTJersey City, NJBoston, MAAtlanta, GA
Salary
$100,000–$167,500 / yr
Posted
2 days ago
Closes
Aug 28, 2026

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $175k
This role $134k
$87k most similar roles pay here $223k

This role pays less than 89% of similar roles. Most pay $152,903–$198,000 — the shaded band above. At the midpoint, this role pays about $134k versus about $175k for comparable roles.

Based on 240 similar postings.

Employer

About State Street

State Street Corporation is one of the world''s largest custodian banks and asset managers, providing investment servicing, investment management, and investment research to institutional investors. Industry: Financial Services & Asset Custody

State Street currently has 168 open roles on FindRole.

Listed pay typically runs $120,000–$188,750 across 166 roles with salary data.

Most-posted roles

View all roles at State Street

At a glance

TL;DR · IAM IGA Data Engineer, Assistant Vice President

State Street’s cyber architecture & engineering team is seeking a Data Engineer to design and implement data models across relational, graph, and lakehouse systems using AWS RDS, AWS Neptune, and Databricks. This role involves building GraphRAG pipelines for intelligent search and LLM-based applications, collaborating with senior engineers and AI teams to ensure scalable and secure solutions. Key responsibilities include designing schemas in AWS RDS and Neptune, developing warehouse/lakehouse models in Databricks Delta Lake, implementing ETL/ELT workflows using PySpark, Airflow, or AWS Glue, and optimizing data storage and retrieval processes. The ideal candidate has 3-5 years of experience in data engineering with strong SQL skills, proficiency in AWS Neptune and graph modeling, and familiarity with GraphRAG concepts and vector search. Additional preferred skills include knowledge of LangChain or LlamaIndex for RAG workflows and exposure to OpenSearch, FAISS, or Databricks Vector Search.

What you'll do

  • Design relational schemas in AWS RDS and graph models in AWS Neptune/Neo4J.
  • Develop warehouse/lakehouse schemas in Databricks Delta Lake for analytics and ML.
  • Implement graph-based retrieval and integrate with vector search for AI use cases.
  • Build ETL/ELT workflows using Databricks (PySpark), Airflow, or AWS Glue.
  • Tune queries and optimize storage for Neptune, RDS, and Delta Lake.

What we're looking for

  • 3-5 years of experience in data engineering roles.
  • Strong SQL and schema design for AWS RDS (PostgreSQL/MySQL).
  • Hands-on experience with AWS Neptune (Gremlin/SPARQL) and graph modeling.
  • Proficiency in Databricks, PySpark, and Delta Lake.
  • Familiarity with GraphRAG concepts, embeddings, and vector search.

More like this

Similar roles

Data Engineer, Databricks

State Street

Quincy, MA +3 44 days ago $110,000$177,500
Databricks AWS PySpark Python Delta Lake CI/CD SQL Power Platform APIs Docker Hadoop S3 Glue Lambda IAM KMS Unity Catalog Power BI Power Apps Terraform PostgreSQL Oracle JSON Parquet

Senior Data Engineer, Assistant Vice President

Citi

Remote (Irving, TX) 40 days ago
Hadoop Spark Kafka Hive Python Scala Java Databricks ETL ELT Microservices AI ML DeepLearning NLP SQL Docker Kubernetes AWS Azure GCP DataMesh Starburst
Remote

Senior Data Engineer, Vice President

Citi

Remote (Irving, TX) 48 days ago $125,760$188,640
Python PySpark Databricks Snowflake Starburst Trino Apache Iceberg AWS Agile Kubernetes Docker CI/CD Prometheus Grafana
Remote

Senior Data Engineer, Vice President

Citi

Remote (Irving, TX) 48 days ago $125,760$188,640
Python PySpark Databricks Snowflake Starburst Trino Apache Iceberg AWS Agile Kubernetes Docker CI/CD Prometheus Grafana
Remote

Big Data Support Engineer, Assistant Vice President

Citi

Texas 51 days ago
Shell Perl C C++ .Net Java HTML5 HDFS Hive Impala Spark YARN Sentry Oozie Kafka SQL ORACLE SYBASE Python UNIX Linux Windows Cloudera CI/CD ITIL DevOps AWS Azure GCP Docker Kubernetes Terraform Prometheus Grafana Jenkins Git GitHub Bitbucket Ansible Chef Puppet Nagios Zabbix ELK_stack Splunk PostgreSQL MSSQL Redis MongoDB SRE