Principal Data Scientist - Immunology - (2 positions)

Johnson & Johnson

Actively hiring Verified listing
Cambridge, MA · Raritan, NJ · San Diego, CA · Spring House, PA · Titusville, NJ Posted 11 days ago $117,000$201,250 / year

At a glance

AI generated

TL;DR

Johnson & Johnson Innovative Medicine is hiring a Principal Data Scientist specializing in Knowledge Graph Engineering for their Immunology R&D Data Science & Digital Health team. This hands-on role involves designing and implementing scalable knowledge graph infrastructure to standardize and connect biomedical and clinical data across the product lifecycle, ensuring interoperability and supporting analytics, search, and AI applications. Key responsibilities include developing graph-based models, curating ontologies using RDF standards, integrating SPARQL/GraphQL services, and collaborating with cross-functional teams to enable advanced features for predictive modeling and terminology services. The ideal candidate has a Ph.D. or master's degree in bioinformatics or related fields, 5+ years of experience in health informatics, expertise in semantic web technologies like SPARQL and RDF, proficiency in graph databases such as Neo4j, and familiarity with CI/CD stacks and DevOps tools. This role requires strong stakeholder management skills and the ability to manage multiple projects simultaneously while ensuring high availability and scalability of the knowledge graph infrastructure.

Skills

SPARQL GraphQL REST Neo4j Amazon Neptune RDF OWL Python CI/CD GitLab Docker SQL NoSQL Kubernetes Jenkins Azure DevOps Prometheus Grafana NLP Ontology Development Knowledge Graph Infrastructure

What you'll do

  • Design and implement a scalable knowledge graph infrastructure for Immunology R&D data.
  • Apply graph-based data modeling to organize, integrate, and retrieve Immunology R&D data efficiently.
  • Curate ontologies using RDF standards to map into established biomedical ontologies and controlled terminologies.
  • Develop ingestion pipelines to normalize and map concepts across diverse data sources.
  • Extend immunology-relevant ontologies and maintain synonyms, cross-references, and provenance.
  • Draft documentation such as data dictionaries and flow diagrams for the knowledge graph infrastructure.

What we're looking for

  • 5+ years of professional experience in health informatics and semantic technologies for biomedical applications.
  • Expertise in large-scale knowledge graph construction, ontology development, and data integration in pharmaceutical or healthcare domains.
  • Proficiency in programming with parser combinators, natural language processing, and linked data (RDF Triple Stores and property graphs).
  • Strong skills in semantic web technologies including SPARQL, RDF, OWL, and familiarity with Neo4j and Amazon Neptune graph databases.
  • Experience managing complex biomedical datasets such as clinical, genomics, and proteomics data.
  • Proficiency in various data storage solutions and data modeling techniques (SQL, key-value, column, document, graph stores).
  • Demonstrated ability to manage multiple projects simultaneously, prioritize work, and deliver maximum business value.

Market check

Salary context

This $117,000–$201,250 range sits above 35% of similar postings on FindRole.

Peer median band

$110,000$220,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$135,000$197,820

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Johnson & Johnson

Johnson & Johnson is a multinational corporation operating in three main segments: consumer health products, pharmaceuticals, and medical devices, known for brands like Tylenol, Band-Aid, and Janssen. Industry: Pharmaceuticals & Medical Devices

Johnson & Johnson currently has 46 open roles on FindRole.

Listed pay typically runs $122,000–$211,025 across 45 roles with salary data.

Most-posted roles

View all roles at Johnson & Johnson

More like this

Similar roles

Associate Director, Commercial Data Science - Immunology

Regeneron

Sleepy Hollow, US 31 days ago $157,200$256,600
SQL Python PySpark R Databricks Snowflake Marketing Mix Modeling MMA HCP Targeting Attribution Modeling Next Best Action AI Machine Learning CI/CD Cloud Analytics Omnichannel Data Clean Room Data LiveRamp Crossix IQVIA APLD Claims Data Regulatory Compliance

Principal Data Scientist

Microsoft

US 70 days ago
Python R T-SQL KQL Apache Spark CI/CD Docker Delta Lake MLflow Azure Machine Learning REST API SQL PostgreSQL MLOps Power BI Tableau Git Jupyter Notebook GitHub Swagger

Principal Data Scientist

Zillow

Remote (Remote-Usa, US) 35 days ago $178,300$284,700
Python R SQL Kubernetes Terraform AWS CI/CD Docker Prometheus Grafana
Remote

Principal Data Scientist

Thermo Fisher

Remote (US) 31 days ago $185,000$215,000
Python SQL R scikit-learn XGBoost PyTorch TensorFlow Databricks Spark Delta Lake Snowflake AWS Azure MLflow Git CI/CD LLM APIs survival analysis causal inference propensity scoring longitudinal modeling
Remote

Principal Data Scientist

Capital One Financial

Mclean, Va, US 52 days ago $161,800$184,600
Python R SQL Pandas Scikit-learn TensorFlow Keras Spark AWS Azure Google Cloud Platform Docker Kubernetes CI/CD Git Jupyter Notebook GitHub PostgreSQL MongoDB