Principal Data Scientist - Oncology

Johnson & Johnson

Quick summary

Work type
On-site
Location
Spring House, PA · Cambridge, MA · Raritan, NJ · Titusville, NJ · San Diego, CA
Salary
$117,000–$201,250 / yr
Posted
9 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $163k
This role $159k
$97k most similar roles pay here $233k

This role pays less than 61% of similar roles. Most pay $130,375–$195,442 — the shaded band above. At the midpoint, this role pays about $159k versus about $163k for comparable roles.

Based on 240 similar postings.

Employer

About Johnson & Johnson

Johnson & Johnson is a multinational corporation operating in three main segments: consumer health products, pharmaceuticals, and medical devices, known for brands like Tylenol, Band-Aid, and Janssen. Industry: Pharmaceuticals & Medical Devices

Johnson & Johnson currently has 71 open roles on FindRole.

Listed pay typically runs $117,000–$201,250 across 68 roles with salary data.

Most-posted roles

View all roles at Johnson & Johnson

At a glance

TL;DR · Principal Data Scientist - Oncology

The Principal Data Scientist - Oncology role at Johnson & Johnson Innovative Medicine involves joining the Data Science and Digital Health team to standardize and connect biomedical and clinical data, focusing on oncology research and development. This hands-on technical position requires expertise in semantic technologies, ontology, and graph data modeling, with a strong background in life sciences. Day-to-day responsibilities include designing and implementing scalable knowledge graph infrastructure for data interoperability, curating ontologies using RDF standards, developing ingestion pipelines, and collaborating with cross-functional teams to enable advanced analytics and AI applications. The ideal candidate holds a Ph.D. or Master's degree in relevant fields and has extensive experience in large-scale knowledge graphs, semantic web technologies like SPARQL and RDF, and graph databases such as Neo4j. Proficiency in various data storage solutions and DevOps tools is also essential for managing high-availability and scalable database infrastructure tailored to oncology R&D needs.

What you'll do

  • Design and implement a scalable knowledge graph infrastructure for data standardization in Oncology R&D.
  • Apply graph-based data modeling to organize, integrate, and retrieve Oncology R&D data efficiently.
  • Curate and extend ontologies using RDF standards to ensure clear mapping into established biomedical ontologies.
  • Develop ingestion and curation pipelines to normalize and map concepts across diverse data sources.
  • Draft documentation such as data dictionaries and flow diagrams to facilitate understanding of the knowledge graph.
  • Partner with cross-functional teams to enable NLP/RAG over graphs for predictive modeling and terminology services.

What we're looking for

  • Ph.D. or Master's degree in bioengineering, computer science, IT, bioinformatics, physics, mathematics, or related fields.
  • 5+ years of professional experience in health informatics and semantic technologies for biomedical applications.
  • Experience in large-scale knowledge graph construction and ontology development in pharmaceutical or healthcare domains.
  • Proficiency in semantic web technologies (SPARQL, RDF, OWL) and familiarity with graph databases (Neo4j, Amazon Neptune).
  • Programming background in parser combinators, natural language processing, and linked data.
  • Demonstrated work with complex biomedical datasets including clinical, genomics, and proteomics data.

More like this

Similar roles

Principal Data Scientist, R&D Oncology

Johnson & Johnson

Spring House, PA 95 days ago $117,000$201,250
Python R SQL AWS Redshift FSx Glue Lambda NoSQL Graph CDISC HL7 FHIR SNOMED_CT OMOP DICOM MLOps DevOps Code_Versioning CI/CD

Manager, Data Science - Oncology

Johnson & Johnson

Spring House, PA 95 days ago $117,000$201,250
Python R SQL AWS Redshift FSx Glue Lambda NoSQL Graph CDISC HL7 FHIR SNOMED_CT OMOP DICOM MLOps DevOps Code_Versioning CI/CD

Director Oncology Data Science LMW

Novartis

Cambridge, MA 87 days ago $194,600$361,400
Python R MachineLearning DataScience CloudComputing AWS BigData DataGovernance StatisticalAnalysis TimeSeriesAnalysis RandomForestAlgorithm

Principal Data Scientist

Zillow

Remote (Remote-Usa, US) 43 days ago $178,300$284,700
Python R SQL Kubernetes Terraform AWS CI/CD Docker Prometheus Grafana
Remote