Lead Software Engineer, Databricks, Spark, AWS

JPMorgan Chase

Quick summary

Work type
On-site
Location
Plano, TXJersey City, NJ
Salary
$152,000–$215,000 / yr
Posted
5 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $200k
This role $184k
$143k most similar roles pay here $234k

This role pays less than 71% of similar roles. Most pay $183,500–$215,500 — the shaded band above. At the midpoint, this role pays about $184k versus about $200k for comparable roles.

Based on 240 similar postings.

Employer

About JPMorgan Chase

JPMorgan Chase & Co. is a global financial services firm and one of the largest banks in the world, offering investment banking, commercial banking, asset management, and consumer financial services.

JPMorgan Chase currently has 436 open roles on FindRole.

Listed pay typically runs $152,000–$215,000 across 230 roles with salary data.

Most-posted roles

View all roles at JPMorgan Chase

At a glance

TL;DR · Lead Software Engineer, Databricks, Spark, AWS

As a Lead Software Engineer at JPMorgan Chase within the Corporate Sector's Chief Technology Office, you will lead an agile team responsible for enhancing and delivering high-throughput, low-latency data pipelines using Databricks and Apache Spark. Your daily tasks include establishing lakehouse patterns with Delta Lake, orchestrating jobs with Databricks Workflows, and integrating with AWS services such as S3, Glue, IAM, CloudWatch, Lambda, Kinesis, and Kafka for secure data ingestion and transformation. You will also drive performance engineering initiatives to optimize cost and ensure data quality through governance tools like Unity Catalog. Proficiency in Python or Java, hands-on experience with Databricks, and solid AWS expertise are essential, along with a strong background in Spark performance tuning and ETL/ELT pipeline architecture. This role is pivotal in supporting the financial institution's technology needs within a highly regulated environment.

What you'll do

  • Lead the architecture and delivery of high-throughput data pipelines using Databricks and Apache Spark.
  • Establish lakehouse patterns with Delta Lake to ensure performance at scale in a secure environment.
  • Own Databricks cluster strategy, including runtime selection, autoscaling, and Spark configuration optimization.
  • Design secure data ingestion frameworks leveraging AWS services like S3, Glue, IAM, and CloudWatch.
  • Drive Spark performance engineering by optimizing partitioning strategies, file sizing, and memory control.

What we're looking for

  • 10+ years of professional software/data engineering experience with substantial production work in Spark on Databricks or EMR.
  • Strong proficiency in Python and/or Java for data processing, platform tooling, and automation.
  • Hands-on expertise with Databricks (Delta Lake, Unity Catalog, Workflows, Repos/notebooks).
  • Solid AWS experience including S3, IAM, Glue, CloudWatch, Kinesis/MSK, DynamoDB.
  • Proven track record in architecting and operating ETL/ELT pipelines with schema design/evolution and reliability engineering.
  • Deep skills in Spark performance tuning and Databricks cluster setup/optimization.
  • Strong SQL and analytics data modeling expertise (dimensional/star schema; lakehouse best practices).

More like this

Similar roles

Lead Software Engineer, Python, Databricks, AWS

JPMorgan Chase

Glasgow, Scotland, United Kingdom 1 day ago
Python PySpark SQL Databricks AWS S3 ECS SNS/SQS Lambda CI/CD Jenkins Airflow Parquet JSON CSV Avro Delta Lake GitHub Copilot Agile Cloud-based data warehouses

Lead Software Engineer, Java/Python, AWS, Spark

JPMorgan Chase

Pune, MH, India 3 days ago
Python Java Spark AWS Terraform Docker Kubernetes CI/CD Apache Airflow Snowflake SQL NoSQL JSON AVRO Parquet Microservices Serverless Test-Driven Development Behavior-Driven Development Infrastructure as Code Kafka Spinnaker

Software Engineer II, Platform Engineer/Databricks

JPMorgan Chase

Jersey City, NJ 19 days ago $118,750$150,000
AWS Databricks Python Java GitHub Bitbucket Jenkins maven Terraform Spark CI/CD Responsible AI Big Data Platform Administration Monitoring Resiliency

Data Lead Software Engineer

JPMorgan Chase

New York, NY +1 3 days ago $152,000$215,000
AWS Python Pyspark Lambda S3 Glue Step Functions Airflow Snowflake Databricks CI/CD Git Jenkins Maven SQL NoSQL Agile