Senior MLOps Engineer, GenAI Framework

Nvidia

Quick summary

Work type
On-site
Location
Santa Clara, CA
Salary
$152,000–$241,500 / yr
Posted
105 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $201k
This role $197k
$141k most similar roles pay here $255k

This role pays less than 55% of similar roles. Most pay $165,402–$237,350 — the shaded band above. At the midpoint, this role pays about $197k versus about $201k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 855 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 843 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Senior MLOps Engineer, GenAI Framework

NVIDIA seeks a Build and CI/CD Engineer to join its GenAI Frameworks team responsible for Megatron-LM and NeMo Framework, open-source platforms designed for Large Language Models and Multimodal applications. The role involves developing and maintaining continuous integration pipelines and release processes, implementing scalable DevOps solutions using Kubernetes, Docker, Slurm, Ansible, GitLab, GitHub Actions, Jenkins, Artifactory, and Jira in hybrid on-premise and cloud environments. Key responsibilities include cluster operations, automating tasks for research and development cycles, and collaborating with CUDA, cuDNN, PyTorch teams to ensure high-quality software delivery. Ideal candidates hold a BS or MS degree in Computer Science or related fields, with 3+ years of industry experience in DevOps and infrastructure engineering, strong system-level programming skills, and expertise in Linux administration, containerization, cluster management, build tools, source code management, and GPU-accelerated systems at scale.

What you'll do

  • Develop and maintain CI/CD pipelines for Megatron-LM and NeMo Framework.
  • Implement scalable DevOps solutions to enable frequent software releases.
  • Work with Kubernetes, Docker, GitLab, and other industry-standard tools.
  • Manage servers, team accounts, and clusters in hybrid environments.
  • Automate tasks to detect accuracy and performance regressions.
  • Develop quality control measures including code analysis and regression testing.

What we're looking for

  • BS or MS degree in Computer Science or related field with 3+ years of DevOps experience.
  • Strong system-level programming skills in Python and shell scripting.
  • Experience with CI/CD tools like GitLab, GitHub Actions, and Jenkins.
  • Proficiency in Linux system administration and containerization technologies (Docker, Kubernetes).
  • Expertise in cluster management and cloud compute technologies (SLURM, k8s).
  • Background in source code management solutions such as GitLab, GitHub.
  • Proven experience with GPU-accelerated systems at scale.

More like this

Similar roles

Senior GenAI Engineer - Solutions Architecture

Citi

Remote (Irving, TX) 9 days ago $125,760$188,640
Python Java C++ Docker Kubernetes LLMOps RAG Context Engineering Vector Database Knowledge Graphs Prompt Injection Defense Dynamic Data Masking CI/CD Logging Explainability Evaluation Pipelines Terraform AWS GCP
Remote

Senior GenAI Platform Engineer - VP

Citi

Remote (388 Greenwich Street - Trading, US) 38 days ago $142,320$213,480
Python FastAPI Flask Pandas Scikit-learn Hugging Face Node.js Express NestJS TypeScript CI/CD Docker Kubernetes AWS Azure Google Cloud Platform PostgreSQL MongoDB Redis Git Jenkins GitHub Bitbucket Terraform Ansible Prometheus Grafana
Remote

Senior GenAI Platform Engineer - VP

Citi

Remote (388 Greenwich Street - Tower, US) 38 days ago $142,320$213,480
Python FastAPI Flask Pandas Scikit-learn Hugging Face Node.js Express NestJS TypeScript CI/CD Docker Kubernetes AWS Azure Google Cloud Platform PostgreSQL MongoDB Redis Git Jenkins GitHub Bitbucket Terraform Ansible Prometheus Grafana
Remote

Senior GenAI Platform Engineer - VP

Citi

Remote (388 Greenwich Street - Trading, US) 38 days ago $142,320$213,480
Python FastAPI Flask Pandas Scikit-learn Hugging Face Node.js Express NestJS TypeScript CI/CD Docker Kubernetes AWS Azure Google Cloud Platform PostgreSQL MongoDB Redis Git Jenkins GitHub Bitbucket Terraform Ansible Prometheus Grafana
Remote

MLOps Engineer, Mid

Booz Allen Hamilton

Chantilly, VA 6 days ago $77,600$176,000
Python AWS Kubernetes Terraform Helm Git Docker CI/CD MLOps SageMaker Lambda API Gateway DynamoDB S3 IAM Prometheus Grafana

Senior Machine Learning Engineer, MLOps West Coast

Autodesk

San Francisco, CA 56 days ago $131,400$235,950
CI/CD Kubernetes Docker Python REST API AWS Azure Grafana Prometheus Git Scrum MLOps LLM API security Rate limiting Authentication Authorization Agile
Hybrid