Senior Research Scientist, Multi-Modal Language Models

Nvidia

Actively hiring
Us, Ca, Santa Clara, US Posted 113 days ago $192,000$304,750 / year

At a glance

AI generated

TL;DR

NVIDIA is hiring a Senior Research Scientist specializing in multi-modal language models to join its cutting-edge Nemotron Multi-modal technology team. This role involves driving advancements by integrating new functionalities, enhancing model generalization through data synthesis and retraining, and developing innovative training recipes for mixed modalities like text, image, video, and audio. The ideal candidate will also contribute to open-source communities, collaborate with researchers to implement cutting-edge ideas in production, and explore novel evaluation paradigms. Required skills include a PhD or equivalent experience in computer science or related fields, proficiency in Python and PyTorch, and expertise in multi-modal LLMs, large-scale distributed systems for deep learning, and contributions to open-source projects.

Skills

Python PyTorch Distributed Systems Deep Learning OpenSource CI/CD Computer Vision MultiModal LLMs NVIDIA Nemotron Multi-modal Technology Algorithms Data Structures Parallel Computing Systems Programming

What you'll do

  • Drive innovation by adding new capabilities to multi-modal language models.
  • Enhance model generalization through data synthesis and retraining strategies.
  • Develop training recipes that integrate multiple modalities like text, images, video, audio.
  • Design solutions for improved efficiency across various performance metrics.
  • Translate cutting-edge research into practical implementations for production use.

What we're looking for

  • PhD in Computer Science, Electrical Engineering, or related field with 4+ years experience.
  • Expertise in multi-modal language models and computer vision.
  • Proficiency in Python and deep learning frameworks like PyTorch.
  • Strong background in algorithms, data structures, and distributed computing.
  • Proven ability to collaborate across research and engineering teams.
  • Experience developing and scaling large distributed systems for deep learning.

Market check

Salary context

This $192,000–$304,750 range sits above 85% of similar postings on FindRole.

Peer median band

$154,725$241,500

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$162,000$237,905

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 802 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Research Scientist, AI-Mediated Reality and Interaction

Nvidia

Us, Ca, Santa Clara, US 24 days ago $192,000$304,750
Python C++ CUDA PyTorch Computer_Vision AI_Algorithms 3D_Graphics_Development Deep_Learning Neural_Rendering Generative_Models Large_Language_Models Human_Behavior_Understanding Digital_Human_Creation CVPR ICCV ECCV SIGGRAPH NeurIPS ICLR

Senior Research Scientist, Efficient Deep Learning

Nvidia

Us, Ca, Santa Clara, US 139 days ago $192,000$304,750
Python PyTorch C++ CUDA TensorFlow Kubernetes Docker CI/CD Git PostgreSQL Hadoop Spark Jupyter GitHub Slack Zoom Google Cloud Platform AWS Azure MLOps Scikit-learn Pandas Numpy

Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents

Nvidia

Us, Ca, Santa Clara, US 139 days ago $224,000$356,500
Python PyTorch LLM training alignment evaluation deep learning NLP data preparation model parallelization tensor parallelism pipeline parallelism multi-modality research knowledge acquisition techniques learning paradigms self-reflection algorithms synthetic data generation reasoning and inference algorithms

Senior Vision Language Model Engineer

Nvidia

Us, Ca, Santa Clara, US 15 days ago $184,000$287,500
Python TensorFlow PyTorch Docker Kubernetes AWS CI/CD PostgreSQL MongoDB Git GitHub Jupyter Swagger RESTful APIs CVPR NeuRIPS ICML ECCV