Sr. Data Delivery Specialist (Public and purchased data collection)

Genentech

Hybrid Actively hiring
South San Francisco Posted 17 days ago $127,800$237,300 / year

At a glance

AI generated

TL;DR

Roche’s Research and Early Development (gRED and pRED) organizations are seeking an entry-level Associate Data Delivery Specialist to join the Data Capability team within the Data and Digital Catalyst organization. This role involves coordinating, preparing, and delivering high-dimensional datasets from external partnerships and public data collections, ensuring they are accessible and well-documented for use in research and AI/ML workflows. Responsibilities include supporting real-world data requests, managing interactions with external partners, enforcing data governance policies, handling sequencing and imaging data, performing quality checks, and collaborating across internal teams to integrate diverse scientific datasets. The ideal candidate has a background in Data Science, Bioinformatics, or related fields, experience with RWD sources, proficiency in Python (Pandas), SQL, and cloud environments like AWS S3, and familiarity with Jupyter notebooks and workflow tools. Knowledge of FAIR data principles and AI/ML workflows is preferred.

Skills

Python Pandas SQL Bash CSV JSON Parquet AWS S3 GCS Azure Jupyter notebooks CI/CD FAIR data principles MLOps

What you'll do

  • Support intake and fulfillment of real-world data requests, ensuring datasets are complete and well-documented.
  • Coordinate with external partners to manage data requests, query submissions, and returns efficiently.
  • Assist in managing data access workflows, ensuring compliance with usage agreements and tracking data usage.
  • Handle high-dimensional sequencing, imaging, and proteomics datasets for standardized formatting and validation.
  • Perform quality checks and metadata validation to ensure datasets are ready for analysis and troubleshooting issues.
  • Contribute to early-stage AI-enabled data curation efforts to improve scalability in data delivery workflows.

What we're looking for

  • PhD or Master’s degree with relevant experience in data science, bioinformatics, health informatics, biomedical engineering, or computer science.
  • Experience working with real-world data (RWD), clinical data, and biomedical datasets.
  • Strong attention to detail and commitment to ensuring high-quality, reliable data delivery.
  • Proficiency in Python (Pandas) or SQL; familiarity with Bash is beneficial.
  • Knowledge of structured data formats (CSV, JSON, Parquet) and exposure to scientific data formats.
  • Familiarity with cloud environments such as AWS S3, GCS, or Azure.

Market check

Salary context

This $127,800–$237,300 range sits above 72% of similar postings on FindRole.

Peer median band

$111,950$198,889

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$126,800$190,587

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Genentech

Genentech is a leading research-driven company dedicated to discovering and developing, manufacturing, and commercializing medicines for people with serious and life-threatening diseases.

Genentech currently has 8 open roles on FindRole.

Listed pay typically runs $170,700–$317,050 across 8 roles with salary data.

Most-posted roles

View all roles at Genentech

More like this

Similar roles

Data Specialist

Caterpillar

East Peoria, Illinois, US 62 days ago $89,210$133,810
Snowflake Python PowerBi Excel ETL Cloud computing solutions IaaS SaaS Serverless computing SQL Agile development methodology PowerPoint CI/CD Databases Data warehouse ETL process Supply chain management Inventory models BOM structures Mechanical engineering drawings

Sr Data Governance, Data Quality Analyst

Pacific Life

Newport Beach Ca-700, US 30 days ago $137,610$168,190
Collibra AWS Snowflake Apache Spark Kafka Git CI/CD Docker SQL Data quality tools Cloud computing software ETL technologies ML based observability tools Alation Atlan

Sr Data Engineer

The Walt Disney Company

Remote (Usa - Ca - 1200 Grand Central Ave, US) 92 days ago $138,900$186,200
Python Java SQL Kafka Flink Spark Kinesis Pinecone Weaviate FAISS pgvector Airflow Dagster AWS S3 Glue MWAA Prometheus Datadog Databricks LangChain
Remote

Sr Data Engineer

The Walt Disney Company

Remote (Usa - Ny - 7 Hudson Square, US) 46 days ago $148,700$199,400
Scala Python Spark Databricks Airflow Snowflake GitHub Actions Jenkins AWS S3 Kafka CI/CD SQL Bash PowerShell GraphQL Redshift BigQuery Scrum Agile
Remote

Sr Data Engineer

The Walt Disney Company

Remote (Usa - Ca - Market St, US) 15 days ago $138,900$186,200
Python Java SQL Kafka Flink Spark Kinesis Pinecone Weaviate FAISS pgvector Airflow Dagster AWS S3 Glue MWAA Datadog Prometheus Databricks LangChain
Remote

Sr Data Engineer

The Walt Disney Company

Remote (Usa - Ny - 7 Hudson Square, US) 11 days ago $148,700$199,400
Scala Python Spark Airflow Databricks Delta Lake Snowflake AWS S3 SQL CI/CD Agile Scrum
Remote