Applications Development Senior Programmer Analyst - Assistant Vice President

Citi

Remote Actively hiring
Remote, USA · Irving, TX Posted 28 days ago $107,120$160,680 / year

At a glance

AI generated

TL;DR

Seeking a Data Engineer with 4-7 years of experience to join our Applications Development team in Irving, Texas. The role involves developing and maintaining scalable ETL/ELT pipelines using PySpark, Spark SQL, and Delta Lake on Databricks, supporting data platforms across AWS, Azure, or GCP, and contributing to Lakehouse architecture implementation. Key responsibilities include optimizing Spark workloads, implementing data federation solutions with Starburst/Trino, and ensuring compliance with data security policies. The ideal candidate will have hands-on experience with Python, PySpark, Databricks, Snowflake, Apache Iceberg, and cloud platforms, as well as a strong understanding of data governance frameworks and Agile methodologies. This role requires expertise in building robust data pipelines for analytics and machine learning workloads while collaborating closely with data scientists to enable AI/ML use cases.

Skills

PySpark Spark SQL Databricks Delta Lake AWS Azure GCP Snowflake Starburst/Trino Apache Iceberg Python Agile CI/CD RAG-based use cases Data Governance RBAC Ab Initio

What you'll do

  • Develop and maintain scalable ETL/ELT pipelines using PySpark and Delta Lake on Databricks.
  • Optimize Spark workloads through performance tuning and partitioning strategies.
  • Implement data federation solutions using Starburst/Trino across multiple sources.
  • Support data governance initiatives, including metadata management and adherence to standards.
  • Ensure compliance with data security policies, including access controls and auditability.
  • Collaborate with data scientists to enable pipelines for AI/ML use cases.
  • Work on Lakehouse architecture implementation, ensuring data quality and reliability.

What we're looking for

  • 4-7 years of experience in Data Engineering or related roles.
  • Expertise in PySpark, Spark SQL, and Databricks Delta Lake for ETL/ELT pipelines.
  • Experience with AWS, Azure, or GCP data services and cloud platforms.
  • Knowledge of Snowflake, Starburst/Trino, and Apache Iceberg for big data technologies.
  • Understanding of data governance frameworks and compliance policies.
  • Hands-on experience supporting data pipelines for analytics and machine learning.

Market check

Salary context

This $107,120–$160,680 range sits above 18% of similar postings on FindRole.

Peer median band

$117,000$198,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$135,000$177,900

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Citi

Citi is one of the world’s most trusted financial institutions, proudly serving millions of customers across the United States.

Citi currently has 336 open roles on FindRole.

Listed pay typically runs $125,760–$188,640 across 308 roles with salary data.

Most-posted roles

View all roles at Citi

More like this

Similar roles