Senior Data Engineer - AI & Analytics Infrastructure

IBM

Quick summary

Work type
On-site
Location
New York, NY
Posted
6 days ago

Market check

Salary context

How this pay compares to similar roles

Similar $185k
$132k most similar roles pay here $237k

This listing doesn't post a salary. Most similar roles pay $142,400–$226,650.

Based on 240 similar postings.

Employer

About IBM

IBM is a US-based global technology company providing hybrid cloud, AI, consulting, enterprise software, and IT infrastructure products and services.

IBM currently has 743 open roles on FindRole.

Listed pay typically runs $1,000,000–$1,000,000 across 8 roles with salary data.

Most-posted roles

View all roles at IBM

At a glance

TL;DR · Senior Data Engineer - AI & Analytics Infrastructure

As a Senior Data Engineer at our firm in New York, you will join the AI & Analytics Infrastructure team to design and scale data pipelines for a critical Agentic AI engagement. Your day-to-day responsibilities include developing robust data pipelines using Microsoft Fabric, Databricks, or Azure Synapse Analytics, ensuring they are performant and reliable for enterprise-scale operations. You will also architect and maintain data infrastructure, implement data lakehouse patterns, and collaborate with AI engineers to ensure structured data outputs. Additionally, you will establish data governance standards, enforce validation frameworks, and optimize storage environments for cost efficiency. Ideal candidates have extensive experience in Azure Data Factory, AWS Glue, Databricks, and Snowflake, along with a background in MLOps and feature engineering for AI models. This role demands expertise in cloud-native deployment practices and secure enterprise data operations across Azure and AWS platforms.

What you'll do

  • Design, build, and maintain robust data pipelines using Microsoft Fabric, Databricks, and Azure Synapse Analytics.
  • Develop scalable architectures to ensure data pipelines are performant and reliable for enterprise use.
  • Define and implement data lakehouse patterns and medallion architecture for AI and analytics use cases.
  • Implement data quality checks and validation frameworks to ensure trustworthy data outputs.
  • Establish and enforce data governance standards including lineage tracking, cataloging, and access controls.

What we're looking for

  • 7+ years of experience designing and maintaining scalable batch and real-time data pipelines on Azure and AWS.
  • Expertise in building and optimizing enterprise data platforms using Azure Data Factory, Azure Data Lake, AWS S3, AWS Glue, Databricks, and Snowflake.
  • Development of robust ETL/ELT frameworks for analytics, reporting, operational, and AI/ML use cases across cloud and hybrid ecosystems.
  • Implementation of scalable ingestion and transformation pipelines for structured, semi-structured, and unstructured data sources.
  • Support for data industrialization through reusable pipeline frameworks, standardized engineering practices, observability, monitoring, automated testing, and CI/CD deployment patterns.
  • Enablement of trusted enterprise data foundations by implementing data quality controls, metadata management, lineage tracking, cataloging, and governance capabilities.

More like this

Similar roles

Senior Data Engineer - AI & Analytics Infrastructure

IBM

Chicago, IL 6 days ago
Azure AWS Databricks Azure Synapse Analytics Microsoft Fabric Snowflake Azure Data Factory Azure Data Lake AWS S3 AWS Glue ETL ELT CI/CD Data Governance Metadata Management Event Hubs MLOps Feature Engineering Infrastructure-as-Code

Senior Data Engineer - AI & Analytics Infrastructure

IBM

Dallas, TX 6 days ago
Azure AWS Databricks Azure Synapse Analytics Snowflake Microsoft Fabric Azure Data Factory Azure Data Lake AWS S3 AWS Glue ETL ELT CI/CD Data Governance MLOps Event Hubs Metadata Management Observability Monitoring Python SQL

Senior Data Engineer - AI and Analytics

CVS Health

Remote (Buffalo Grove-2100 E Lake Cook, US) 29 days ago $101,970$203,940
Python SQL NoSQL ETL ELT Kubernetes GCP AWS Azure Big data Cloud architecture Data warehouses Reporting tools Query optimization Metadata management Workload management Real-time streaming Spark Streaming CI/CD Git
Remote

Senior IT AI/ML Engineer - Data Analytics

Palo Alto Networks

Santa Clara, CA 3 days ago
Python TensorFlow PyTorch scikit-learn R AWS Azure Google Cloud Platform MLOps Kubernetes Docker CI/CD SQL C++ Apache Kafka Prometheus Grafana Git GitHub

Senior AI Platform Engineer- Data and Systems

Adobe

San Jose 39 days ago $208,300$301,600
Apache_Spark Databricks Delta_Lake Kafka Kinesis Flink Python Scala SQL AWS Azure Docker Kubernetes CI/CD MCP LangChain LLMs Feature_Stores RAG Unity_Catalog FAISS Pinecone Weaviate Semantic_layers DataHub OpenMetadata AI-powered_developer_tools

Senior Analytics Engineer, People Data

Anduril Industries

Boston, Massachusetts 2 days ago $166,000$220,000
SQL Python Snowflake Google BigQuery AWS Redshift Databricks Delta Lake dbt Apache Airflow Flyte CI/CD Terraform Kubernetes Tableau Power BI Looker Apache Iceberg Palantir Foundry Rippling API Workday API Oracle HCM Cloud API