Senior Data Engineer - Vice President

Citi

Remote Actively hiring
Remote, USA · Irving, TX Posted 18 days ago $125,760$188,640 / year

At a glance

AI generated

TL;DR

Citi is hiring a Senior Data Engineer to join its dynamic technology team, focusing on designing and building scalable data infrastructure on cloud platforms. This role involves creating ETL/ELT pipelines using PySpark, Spark SQL, and Delta Lake on Databricks, managing cloud-native services for storage and analytics, and optimizing performance through tuning and autoscaling. The candidate will also work on advanced AI projects like Retrieval-Augmented Generation (RAG) and Agentic AI systems, requiring proficiency in Python, Databricks, Snowflake, Starburst/Trino, and Apache Iceberg. Additionally, the role demands expertise in Agile methodologies, leadership skills to guide project teams, and strong client interaction abilities to manage stakeholder expectations effectively. The ideal candidate has 6-10 years of experience in large-scale enterprise data engineering, particularly within financial services, with a focus on delivering robust, secure, and high-quality data solutions at scale.

Skills

Python PySpark Databricks Snowflake Starburst Trino Apache Iceberg AWS Agile Kubernetes Docker CI/CD Prometheus Grafana

What you'll do

  • Design and build scalable ETL/ELT pipelines using PySpark and Delta Lake on Databricks.
  • Manage data solutions on cloud platforms, implementing storage, processing, and analytics services.
  • Optimize Spark workloads and clusters for performance and cost efficiency.
  • Implement and manage Lakehouse architecture with Delta Lake to ensure high-quality data governance.
  • Lead the design of Starburst-based data solutions, ensuring scalability and reliability.
  • Develop robust data pipelines focusing on data quality, lineage, and compliance.

What we're looking for

  • Extensive experience in designing and managing large-scale data pipelines using PySpark, Spark SQL, and Delta Lake on Databricks.
  • Proficiency in cloud-native services for data storage, processing, and analytics across AWS, GCP, or Azure platforms.
  • Expertise in optimizing big data workloads, including performance tuning and cost management of Databricks clusters.
  • Strong background in implementing and managing Lakehouse architecture with Delta Lake and Unity Catalog for robust data governance.
  • Leadership skills to guide projects using Agile methodologies, ensuring timely delivery and alignment with organizational goals.
  • Hands-on experience with Snowflake, Starburst/Trino, Apache Iceberg, and federated query engines for unified data access.
  • Demonstrable ability to mentor junior engineers and promote best practices in a dynamic technology team.

Market check

Salary context

This $125,760–$188,640 range sits above 46% of similar postings on FindRole.

Peer median band

$125,760$200,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$140,418$195,200

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Citi

Citi is one of the world’s most trusted financial institutions, proudly serving millions of customers across the United States.

Citi currently has 336 open roles on FindRole.

Listed pay typically runs $125,760–$188,640 across 308 roles with salary data.

Most-posted roles

View all roles at Citi

More like this

Similar roles

Senior Data Engineer - Vice President

Citi

Remote (6400 Las Colinas Blvd Irving, US) 18 days ago $125,760$188,640
Python PySpark Databricks Snowflake Starburst Trino Apache Iceberg AWS Agile Kubernetes Docker CI/CD Prometheus Grafana
Remote

Sr. Data Engineer - Assistant Vice President

Citi

Remote (6460 Las Colinas Blvd Irving, US) 10 days ago
Hadoop Spark Kafka Hive Parquet Avro Python Scala Java Databricks Microservices AI ML Deep Learning NLP SQL Docker Kubernetes Data Mesh Starburst
Remote

SR. Data Engineer - Assistant Vice President

Citi

Remote (6460 Las Colinas Blvd Irving, US) 10 days ago
Hadoop Spark Kafka Hive Python Scala Java Databricks ETL ELT Microservices AI ML DeepLearning NLP SQL Docker Kubernetes AWS Azure GCP DataMesh Starburst
Remote

Senior Data Engineer Lead - Senior Vice President

Citi

Remote (480 Washington Boulevard Jersey City, US) 20 days ago $176,720$265,080
Python Scala SQL AWS GCP Azure Hadoop Hive Impala Apache_Spark Kafka Flink Airflow Dagster Prefect PostgreSQL Oracle MongoDB Cassandra CI/CD Jenkins GitLab_CI GitHub_Actions
Remote

Data Engineer, Senior

Booz Allen Hamilton

Locations Huntsville, Alabama, US 28 days ago $77,500$176,000
SQL Python Scala Spark Java AWS EMR AWS Glue Azure Data Factory Power Apps Apache Spark Apache NiFi AirFlow Databricks Snowflake Redshift BigQuery Elasticsearch Solr MongoDB Cosmos DB Jenkins GitHub NIST 800.53 FISMA CI/CD

Data Engineer - Assistant Vice President

Citi

Remote (3800 Citigroup Center Drive Building C Tampa, US) 23 days ago $96,960$145,440
Python Java Scala Hadoop Snowflake Databricks SQL Kubernetes Spark Kafka Airflow Terraform AWS Google Cloud
Remote