Principal Data Scientist, R&D Oncology

Johnson & Johnson

Actively hiring
Spring House, PA · Cambridge · Raritan, NJ · Titusville, NJ · San Diego, CA Posted 92 days ago $117,000$201,250 / year

At a glance

AI generated

TL;DR

Johnson & Johnson Innovative Medicine is seeking a Principal Data Scientist for its R&D Oncology team, focusing on advancing data capture and workflow optimization. This role involves designing and implementing engineering requirements to support diverse data sources in oncology research and development, including clinical, pre-clinical, real-world data, and ‘omics platforms. The candidate will collaborate with data science and oncology partners to translate business needs into high-quality data products, develop AI-ready data systems, and create new data models using cloud-based technologies like AWS S3. Key skills include proficiency in Python, R, SQL, and experience with unstructured databases such as NoSQL and graph databases. The ideal candidate has a strong background in data engineering, healthcare industry knowledge, and the ability to manage multiple projects while adhering to software development best practices.

Skills

Python R SQL AWS Redshift FSx Glue Lambda NoSQL Graph CDISC HL7 FHIR SNOMED_CT OMOP DICOM MLOps DevOps Code_Versioning CI/CD

What you'll do

  • Design and maintain data pipelines for Oncology R&D from diverse sources.
  • Develop high-quality data products by translating business requirements into technical solutions.
  • Create AI-ready data systems aligned with Oncology R&D needs using cloud technologies.
  • Implement standard enterprise-level data models to build new data repositories.
  • Optimize data flows for structured and unstructured data using Python, AWS services.
  • Ensure data quality and performance through KPIs and compliance measures.

What we're looking for

  • Advanced degree in Computer Science, Engineering, Life Sciences, or related field.
  • 3+ years of data engineering experience with expertise in data modeling and database design.
  • Proficiency in Python, R, SQL, and cloud architecture tools like AWS services.
  • Experience with unstructured databases (NoSQL) and other database types (Graph).
  • Strong analytical skills and proven ability to lead improvement initiatives across disciplines.
  • Demonstrated capability in stakeholder management, requirements gathering, and project planning.

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $166k
This role $159k
$97k most similar roles pay here $233k

This role pays less than 63% of similar roles. Most pay $135,000–$198,000 — the shaded band above. At the midpoint, this role pays about $159k versus about $166k for comparable roles.

Based on 240 similar postings.

Employer

About Johnson & Johnson

Johnson & Johnson is a multinational corporation operating in three main segments: consumer health products, pharmaceuticals, and medical devices, known for brands like Tylenol, Band-Aid, and Janssen. Industry: Pharmaceuticals & Medical Devices

Johnson & Johnson currently has 74 open roles on FindRole.

Listed pay typically runs $122,000–$212,750 across 74 roles with salary data.

Most-posted roles

View all roles at Johnson & Johnson

More like this

Similar roles

Manager, Data Science - Oncology

Johnson & Johnson

Spring House, PA 92 days ago $117,000$201,250
Python R SQL AWS Redshift FSx Glue Lambda NoSQL Graph CDISC HL7 FHIR SNOMED_CT OMOP DICOM MLOps DevOps Code_Versioning CI/CD

Director Oncology Data Science LMW

Novartis

Cambridge, MA 84 days ago $194,600$361,400
Python R MachineLearning DataScience CloudComputing AWS BigData DataGovernance StatisticalAnalysis TimeSeriesAnalysis RandomForestAlgorithm

Director, Precision Medicine Data Science & AI

Novartis

East Hanover 20 days ago $194,600$361,400
Python TensorFlow PyTorch AWS Azure GCP Spark NLP Deep Learning LLMs ICD-10 SNOMED CT LOINC RxNorm CPT CI/CD FDA CDS HIPAA EHR Genomic Data Patient Registries
Hybrid

Expert Data Scientist, Early Clinical Development

Genentech

South San Francisco, CA 19 days ago $185,200$343,900
Python R SAS Tableau Spotfire R/Shiny CI/CD Kubernetes AWS Azure GCP Docker Terraform Git Jenkins Prometheus Grafana SDTM ADaM NLP LLMs Chatbots
Hybrid

Chief Data Scientist

Mondelēz International

East Hanover, New Jersey 62 days ago $174,100$287,265
MySQL MongoDB BigQuery Neo4j GitHub Mercurial SVN Jupyter Zeppelin Angular Vue React D3.js Shiny Plotly Dash Node.js Flask Django Google Cloud Microsoft Azure AWS Apache Spark Presto Impala CI/CD

Data Scientist Lead

USAA

US 6 days ago $164,780$314,960
Python R SQL HQL NoSQL JSON XML Linear Regression Logistic Regression Support Vector Machines Decision Trees Clustering Algorithms Project Management Model Risk Management Data Engineering Machine Learning AI/ML Generative AI Cloud Services CI/CD
Hybrid