Senior Scientist, Synthetic Data and Privacy

Nvidia

Remote

Quick summary

Work type
Remote
Location
Santa Clara, CA
Salary
$168,000–$264,500 / yr
Posted
5 days ago

Market check

Salary context

Above market

How this pay compares to similar roles

Similar $156k
This role $216k
$79k most similar roles pay here $284k

This role pays more than 89% of similar roles. Most pay $126,800–$184,912 — the shaded band above. At the midpoint, this role pays about $216k versus about $156k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 994 open roles on FindRole.

Listed pay typically runs $168,000–$270,250 across 977 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Senior Scientist, Synthetic Data and Privacy

As a Senior Scientist at NVIDIA, you will join the cutting-edge research team focused on advancing large language models (LLMs) by developing synthetic data generation and privacy-preserving techniques. Your daily tasks include building LLM-based methods for generating high-quality synthetic data with context-aware anonymization, optimizing task-specific LLMs for efficient inference, and maintaining open-source libraries within the NVIDIA NeMo ecosystem. You will also mentor junior researchers and publish research at top AI conferences to enhance technical leadership. The role requires expertise in machine learning, NLP, and privacy-enhancing technologies, along with experience in developing software libraries used by a broad community. Proficiency in LLM inference optimization, data processing pipelines, and knowledge of global privacy regulations like GDPR or CCPA is essential for this position at the forefront of AI innovation.

What you'll do

  • Build LLM-based methods for synthetic data generation and context-aware anonymization.
  • Optimize task-specific LLMs for low-latency, high-throughput inference.
  • Design and maintain open-source libraries with clean APIs and documentation.
  • Publish original research at top machine learning conferences.
  • Mentor interns and junior researchers to foster technical growth.

What we're looking for

  • PhD in Computer Science, Machine Learning, Statistics, or related field with 2+ years of applied research experience.
  • Proven expertise in developing software libraries used by a broad developer community.
  • Strong publication record at top machine learning and AI conferences.
  • Deep understanding of large language models (LLMs) and inference optimization techniques.
  • Experience in synthetic data generation, anonymization, and PII detection.
  • Active contributions to open-source projects in ML, security, or privacy domains.
  • Knowledge of global privacy regulations such as GDPR and CCPA.

More like this

Similar roles

Senior Scientist, Synthetic Data Generation

Nvidia

Remote (Santa Clara, CA) +1 5 days ago $168,000$264,500
Python LLM-based methods Git CI/CD vLLM TGI NeMo OpenAPI Docker Kubernetes AWS GCP Azure PostgreSQL MongoDB TensorFlow PyTorch Hugging Face Transformers Apache Airflow
Remote

Lead Data Privacy Engineer

CVS Health

Remote (Work At Home-California, US) 42 days ago $106,605$284,280
Python Java Go Rust CI/CD AWS Azure GCP SQL NoSQL Docker Kubernetes Terraform Prometheus Grafana GDPR CCPA HIPAA NIST FIPS 140-2 ISO HITRUST PCI CPRA DLP CASB Data Encryption Tokenization Key Management
Remote

Senior, Data Scientist

Walmart

Seattle, WA +1 62 days ago $108,000$216,000
Python SQL Java machine learning data visualization feature selection model tuning scalable data storage data ecosystems data quality standards CI/CD

Senior, Data Scientist

Walmart

Seattle, WA 59 days ago $108,000$216,000
Python SQL Java machine learning data visualization data ecosystems data quality standards scalable data storage solutions CI/CD AWS Kubernetes

Data Scientist, Senior

Qualcomm

San Diego, CA 116 days ago
Python AWS Azure GCP SQL NoSQL LLMs LangChain Keras PyTorch TensorFlow scikit-learn APIs CI/CD Prometheus Grafana