Data Engineer

Equifax

Hybrid Actively hiring
Alpharetta, GA Posted 23 days ago

At a glance

AI generated

TL;DR

As a Senior Data Engineer on our dynamic data and analytics team, you will be responsible for building complex batch and streaming pipelines to ingest data from upstream cloud systems, designing scalable frameworks for data pipeline development, and leveraging AI-powered coding assistants to optimize code and generate documentation. You will also develop prompts for Large Language Models (LLMs) to assist in tasks like data cleansing and transformation logic generation, while maintaining robust data pipelines that support AI/ML applications. Key technologies include BigQuery, DataFlow, Pub/Sub, Cloud Functions, Airflow, Vertex AI, and Python with SQL proficiency. Experience with GCP big data environments, Airflow or Cloud Composer, and DevOps/CICD practices is essential, along with a strong background in data engineering principles and communication skills to engage both technical and non-technical stakeholders effectively.

Skills

GCP BigQuery DataFlow DataProc Pub/Sub Cloud Functions Airflow Vertex AI Python SQL Google Cloud Composer Terraform Jenkins GitHub LangChain LlamaIndex

What you'll do

  • Build complex batch and streaming pipelines to ingest data from cloud systems.
  • Design and implement frameworks for scaling development of data pipelines.
  • Use AI-powered coding assistants to accelerate and optimize code development.
  • Develop prompts for Large Language Models to assist in data-related tasks.
  • Design scalable data pipelines supporting AI/ML applications and automation.
  • Explore and implement AI agents to automate repetitive data management tasks.

What we're looking for

  • At least 5 years of experience in data engineering or related field.
  • Strong understanding of data engineering principles, including modeling and warehousing.
  • Experience building complex data pipelines using BigQuery, DataFlow, Pub/Sub, etc.
  • Proficiency in Python development and professional SQL skills.
  • Ability to communicate technical concepts effectively to various stakeholders.
  • At least 1 year working in a GCP big data environment.

Market check

Salary context

This listing doesn't show a salary. Similar roles on FindRole typically pay $99,000–$189,650.

Peer median band

$99,000$189,650

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$126,800$188,714

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Equifax

Equifax is a global data, analytics, and technology company and one of the "Big Three" credit reporting agencies, specializing in consumer and commercial credit information.

Equifax currently has 22 open roles on FindRole.

Most-posted roles

View all roles at Equifax

More like this

Similar roles

Data Engineer

Equifax

US 23 days ago
GCP BigQuery DataFlow DataProc Pub/Sub Cloud Functions Airflow Cloud Composer Vertex AI Python SQL Terraform Jenkins GitHub LangChain LlamaIndex CI/CD

Data Engineer

Booz Allen Hamilton

Locations Mclean, Virginia, US 9 days ago $77,600$176,000
AWS Python SQL Apache Spark GitLab CI/CD Terraform Amazon S3 AWS Glue Amazon Athena AWS Lambda Amazon Redshift Parquet DevSecOps APIs Event-driven integration Medallion architecture Data lake Distributed data processing

Data Engineer

Booz Allen Hamilton

Locations Fayetteville, North Carolina, US 25 days ago $77,500$176,000
Python AWS CI/CD SQL Terraform S3 IAM EventBridge StepFunctions Lambda dbt Airflow NoSQL GraphDatabases DataWarehousing Agile Waterfall Iterative Spiral ApacheIceberg ZeroTrust ABAC

Data Engineer

Booz Allen Hamilton

US 11 days ago $61,900$141,000
Python AWS ETL Git SQL Linux Terraform Apache Spark AWS EMR Redshift SageMaker Databricks CloudFormation CI/CD Prometheus Grafana

Data Engineer

Booz Allen Hamilton

Locations Alexandria, Virginia, US 63 days ago $62,000$141,000
Python SQL PySpark Apache Airflow Luigi Spark Databricks Hadoop Hive AWS EMR Kafka Shapefile GeoJSON KML GDAL Geopandas PostGIS Git

Data Engineer

Q2

Cary, North Carolina, US 37 days ago
Python SQL Snowflake Apache Airflow dbt Kafka Terraform Kubernetes Docker Git CI/CD PostgreSQL AWS Glue Pyspark Databricks SageMaker