Principal Data Scientist - AI Foundations, Specialist Models

Capital One Financial

Quick summary

Work type
On-site
Location
McLean, VA · New York, NY · San Jose, CA
Salary
$161,800–$184,600 / yr
Posted
22 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $197k
This role $173k
$136k most similar roles pay here $248k

This role pays less than 60% of similar roles. Most pay $157,050–$236,900 — the shaded band above. At the midpoint, this role pays about $173k versus about $197k for comparable roles.

Based on 240 similar postings.

Employer

About Capital One Financial

Capital One Financial is a bank holding company specializing in credit cards, auto loans, banking, and savings products, known for its data-driven approach to consumer and commercial finance. Industry: Financial Services & Banking

Capital One Financial currently has 498 open roles on FindRole.

Listed pay typically runs $197,300–$225,100 across 495 roles with salary data.

Most-posted roles

View all roles at Capital One Financial

At a glance

TL;DR · Principal Data Scientist - AI Foundations, Specialist Models

As a Principal Data Scientist on the Entity Resolution Systems team, you will lead the development of advanced machine learning solutions to enhance entity resolution across various business units. Your daily tasks include collaborating with data scientists, software engineers, and product managers to create cutting-edge entity resolution systems using Python, AWS, Spark, transformers, graph ML, and other modern technologies. You will work on large datasets to uncover valuable insights that drive personalized experiences for millions of customers. Ideal candidates are innovative, creative, technically proficient, and adept at handling big data challenges. This role demands expertise in leveraging agentic AI tools and workflows to build scalable solutions that address real-world business problems, making a significant impact within the enterprise.

What you'll do

  • Lead the development of cutting-edge machine learning solutions for entity resolution.
  • Apply deep learning and transformer architectures to analyze large datasets.
  • Work with Python, AWS, Spark, and graph ML technologies to extract insights.
  • Collaborate on building a modern, scalable entity resolution stack.
  • Experiment with Agentic AI tools to enhance testing and innovation.

What we're looking for

  • Extensive experience in building machine learning solutions for entity resolution.
  • Proficient in using Python, AWS, Spark, and modern ML models like transformers.
  • Ability to work with large datasets and extract meaningful insights from numeric and textual data.
  • Experience in leveraging agentic AI tools and workflows for development and testing.
  • Strong background in statistical methods and big data analytics.
  • Collaborative skills to partner with cross-functional teams including product managers and engineers.

More like this

Similar roles

Lead Data Scientist - Document AI

CVS Health

Remote (New York-161 Ave Of The Americas, US) 10 days ago $142,140$284,280
Python SQL Machine Learning Statistical Analysis Predictive Modeling Data Lineage Traceability Explainability CI/CD Healthcare Industry Knowledge Large Data Set Analysis Multiple Data Sources MLOps
Remote

Principal Data Scientist

Zillow

Remote (Remote-Usa, US) 43 days ago $178,300$284,700
Python R SQL Kubernetes Terraform AWS CI/CD Docker Prometheus Grafana
Remote

Principal Data Scientist

Northrop Grumman

San Diego, CA 3 days ago $125,300$187,900
Python SQL Tableau .DAT file decoding CI/CD PowerBI Plotly Kubernetes Docker PostgreSQL AWS RStudio Git Jupyter Notebook Scikit-learn TensorFlow Pandas NumPy Matplotlib

Senior Platform & AI Engineer

Adobe

San Jose 9 days ago $177,900$257,550
AWS Python Apache Airflow DynamoDB MySQL SageMaker OpenSearch Pinecone LangChain LangGraph MLFlow CI/CD Docker Kubernetes PySpark

Machine Learning - Data Scientist Lead

Apple Inc

Sunnyvale, CA 36 days ago $181,100$318,400
Python NumPy pandas scikit-learn PyTorch TensorFlow LLMs multimodal_models BLEU ROUGE FID OpenEval ELO-based_ranking CI/CD