Staff Data Engineer (Audio/ML)

The Walt Disney Company

Actively hiring
Remote (Usa - Ca - Skywalker Sound-Nicasio, US) Posted 43 days ago $170,500$228,600 / year

At a glance

AI generated

TL;DR

The Skywalker Sound Development Group is seeking a Staff Data Engineer with expertise in Audio/ML to develop and maintain scalable data pipelines for AI/ML research focused on immersive audio applications. This role involves designing robust systems for processing large-scale audio datasets, ensuring efficient model training workflows, and collaborating closely with researchers to refine data requirements. Key responsibilities include automating data cleaning, normalization, and augmentation processes, integrating external datasets while adhering to legal standards, and creating tools for dataset annotation and curation. The ideal candidate holds a Master’s or PhD in Data Engineering/Science, Computer Science, or Signal Processing, with extensive experience in building pipelines for AI/ML applications using Python, Pandas, NumPy, PyTorch, Librosa, FFmpeg, GitLab, Apache Spark, Airflow, and Kubernetes. Knowledge of cloud platforms like AWS S3 and Redshift is essential, as is a strong understanding of data pipeline requirements for iterative research workflows in the audio domain.

Skills

Python Pandas NumPy PyTorch Librosa FFmpeg SoX Apache Spark Airflow Kubernetes Docker AWS S3 Redshift Google BigQuery GitLab CI/CD Tableau Matplotlib

What you'll do

  • Design and maintain scalable data pipelines for large-scale audio datasets.
  • Develop preprocessing techniques for immersive and multichannel audio formats.
  • Automate data cleaning, normalization, and augmentation processes for AI/ML models.
  • Integrate external datasets and APIs while ensuring compliance with legal standards.
  • Monitor and optimize pipeline performance to handle complex data structures.
  • Create tools for annotating and curating datasets using active learning methods.
  • Perform exploratory data analysis to validate dataset quality and identify gaps.

What we're looking for

  • 8+ years of experience in data engineering with a focus on AI/ML pipelines.
  • Master’s Degree or PhD in Data Science, Computer Science, Signal Processing, or related field.
  • Proficiency in Python and expertise in Pandas, NumPy, PyTorch for data manipulation.
  • Hands-on experience with audio processing libraries like Librosa, FFmpeg, SoX.
  • Familiarity with scalable pipeline tools such as Apache Spark, Airflow, Kubernetes.
  • Strong understanding of cloud-based platforms for storage and processing (AWS S3, Redshift).
  • Experience with immersive and multichannel audio formats.

Market check

Salary context

This $170,500–$228,600 range sits above 64% of similar postings on FindRole.

Peer median band

$132,000$225,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$129,987$214,500

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About The Walt Disney Company

The Walt Disney Company is a diversified global entertainment and media enterprise operating in segments including Disney Parks, Experiences and Products; Entertainment (ABC, Hulu, Disney+); and ESPN. Industry: Entertainment & Media

The Walt Disney Company currently has 121 open roles on FindRole.

Listed pay typically runs $141,900–$190,300 across 113 roles with salary data.

Most-posted roles

View all roles at The Walt Disney Company

More like this

Similar roles

Data Engineer, Staff

Qualcomm

San Diego, Ca,Us, US 16 days ago $132,000$198,000
Databricks AWS Python Delta Lake Unity Catalog SQL NoSQL CI/CD Kafka Spark Hadoop Terraform Git Jenkins Prometheus Grafana Snowflake Redshift PostgreSQL MongoDB Kubernetes Docker Airflow Vault Fivetran HVR Data Lineage Tools AI/ML Platforms

Data Engineer, Senior Staff

Qualcomm

San Diego, Ca,Us, US 16 days ago $158,400$237,600
Databricks AWS Python Delta Lake Unity Catalog CI/CD SQL NoSQL Data Structures AI Machine Learning Kafka Hadoop Spark Terraform Git Jenkins Prometheus Grafana Kubernetes Vault Fivetran HVR

Staff Data Engineer

Blackline

New York, New York, US 30 days ago $193,000$193,000
PySpark AWS Azure Google Cloud FiveTran Plaid BlackLine CI/CD Python ETL Kafka PostgreSQL Snowflake Docker Git Terraform Prometheus Grafana BigQuery Hadoop SparkSQL

Staff Data Engineer

PayPal

Usa - California - San Jose - Corp - N First St, US 59 days ago $159,500$236,500
SQL Python dbt Airflow Terraform CI/CD PostgreSQL Kafka Docker Prometheus Grafana Git Jenkins AWS Azure Google Cloud Platform Snowflake BigQuery Data Quality Tools Schema Drift Detection Tools

Staff Data Engineer

PayPal

Usa - California - San Jose - Corp - N First St, US 63 days ago $159,500$236,500
SQL Python dbt Airflow Terraform CI/CD PostgreSQL Kafka Docker Prometheus Grafana Git Jenkins Snowflake BigQuery AWS Azure Google Cloud Platform Data Quality Event Instrumentation Schema Management Data Contracts A/B Testing Product Analytics Full Stack Data Engineering

Staff Data Engineer

Samsung Electronics

645 Clyde Avenue, Mountain View, Ca, Usa, US 85 days ago $190,000$190,000
Python Java Scala Kubernetes Apache_Flink Apache_Ignite Hadoop Spark SQL Airflow RESTful_API Golang Snowflake MapReduce