Data Engineer - Financial Crimes - Associate

Morgan Stanley

Quick summary

Work type
On-site
Location
New York, NY
Salary
$90,000–$150,000 / yr
Posted
1 day ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $164k
This role $120k
$77k most similar roles pay here $216k

This role pays less than 89% of similar roles. Most pay $126,800–$202,165 — the shaded band above. At the midpoint, this role pays about $120k versus about $164k for comparable roles.

Based on 240 similar postings.

Employer

About Morgan Stanley

Morgan Stanley is a global financial services firm providing investment banking, securities, wealth management, and investment management services to corporations, governments, institutions, and individuals. Industry: Investment Banking & Financial Services

Morgan Stanley currently has 39 open roles on FindRole.

Listed pay typically runs $140,000–$165,000 across 37 roles with salary data.

Most-posted roles

View all roles at Morgan Stanley

At a glance

TL;DR · Data Engineer - Financial Crimes - Associate

As a Principal Software Engineer at Morgan Stanley, you will join a dynamic team responsible for developing and maintaining robust PySpark-based ETL frameworks in distributed cluster environments. Your day-to-day responsibilities include designing data models from business requirements, performing data analysis, troubleshooting issues, and providing Level-3 production support. You will work with relational databases, big data stores, and AI-enabled tools to enhance data quality and performance. The ideal candidate has at least five years of experience in the financial industry, strong SQL skills, and hands-on experience with technologies such as Power Designer, Autosys job definitions, and Graph Databases like Stardog. This role requires a deep understanding of Linux shell scripting, Java development, and data quality rules, all while contributing to Morgan Stanley’s global mission to serve clients across 40 countries.

What you'll do

  • Design and implement PySpark-based ETL frameworks with error handling in distributed environments.
  • Perform data analysis and sourcing in relational and big data database environments.
  • Translate business requirements into data models and program design specifications.
  • Modify ETL programs to adapt to evolving business needs.
  • Troubleshoot data issues and provide Level-3 production support as needed.
  • Use AI-enabled tools to improve data mapping, lineage creation, and profiling.

What we're looking for

  • At least 5 years of financial industry technology experience with knowledge of financial services data.
  • Expertise in PySpark, Python, SQL, and relational databases for ETL development.
  • Strong analytical skills and proficiency in troubleshooting complex data issues.
  • Experience in developing robust data models and program design specifications.
  • Hands-on experience with graph database technologies like Stardog (desired).
  • Familiarity with Linux shell scripting and Java development (preferred).
  • Ability to provide Level-3 production support and optimize query performance.

More like this

Similar roles

Data Engineer, Associate

Blackrock

New York 50 days ago $132,500$162,000
Python Scala Spark Hadoop PySpark Hive Snowflake TransactSQL NoSQL GraphQL GreatExpectations Swagger OpenAPI AWS Azure Docker Kafka Kubernetes Jenkins GitLabCI Axon UnityCatalog Databricks Airflow DBT
Hybrid

Lead Data Engineer (Finance Tech)

Capital One Financial

Richmond, VA 9 days ago $197,300$225,100
Python SQL AWS Kubernetes Terraform CI/CD Docker PostgreSQL Snowflake Apache Airflow Git Jenkins Ansible Prometheus Grafana

Data Engineer, Operations Finance

Apple Inc

Sunnyvale, CA 38 days ago $146,300$244,100
Dataiku Python SQL Airflow Jenkins Snowflake PostgreSQL CI/CD Docker Kubernetes AWS GCP Azure Prometheus Grafana Git Terraform

Data Engineer - Senior Associate

PWC

Atlanta 10 days ago $77,000$202,000
AWS Azure GCP Snowflake Databricks AWS Glue AWS Lambda Azure Data Factory Azure Functions GCP Functions GCP Dataproc GCP Dataflow ETL ELT CI/CD SQL Python Kubernetes