AI Compute Engineer - NVIS

Nvidia

Remote

Quick summary

Work type
Remote
Location
Santa Clara, CA
Salary
$92,000–$155,250 / yr
Posted
3 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $195k
This role $124k
$75k most similar roles pay here $254k

This role pays less than 96% of similar roles. Most pay $154,400–$235,750 — the shaded band above. At the midpoint, this role pays about $124k versus about $195k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 997 open roles on FindRole.

Listed pay typically runs $168,000–$270,250 across 984 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · AI Compute Engineer - NVIS

NVIDIA seeks an AI Compute Engineer to join its Infrastructure Specialists team, where you will work on large-scale AI Compute projects by deploying and managing AI Compute/HPC infrastructure in Linux-based environments. This role involves collaborating with customers, partners, and internal teams to analyze, define, and implement complex systems, ensuring thorough documentation and knowledge transfer for successful customer rollouts. You will leverage your expertise in Linux system administration, cluster management tools like SLURM and Kubernetes, and scripting languages such as Bash and Python to deploy and validate AI infrastructure. Ideal candidates have 4+ years of experience in providing support and deployment services, with a strong background in networking, system design, and automation. Experience with GPU hardware/software, InfiniBand, MPI, and storage technologies like Lustre is highly valued for this dynamic customer-focused role.

What you'll do

  • Deploy and manage AI Compute/HPC infrastructure in Linux-based environments.
  • Serve as the domain expert during customer planning calls and implementation phases.
  • Create handover documentation for customers to support system rollouts.
  • Provide feedback to internal teams by documenting issues and suggesting improvements.
  • Utilize cluster management tools like SLURM, Kubernetes, or LSF for provisioning.

What we're looking for

  • 4+ years of experience in deploying and managing AI Compute/HPC infrastructure.
  • Proficiency in Linux system administration and cluster management tools like SLURM or Kubernetes.
  • Strong scripting skills with Bash, Python, Ansible, and other automation tools.
  • Excellent interpersonal and customer support skills for resolving issues effectively.
  • Experience with benchmarking tools such as HPL, NCCL tests, MLPerf.
  • Degree in Computer Science, Electrical Engineering, or related field.

More like this

Similar roles

Senior AI Compute Engineer - NVIS

Nvidia

Remote (Santa Clara, CA) 66 days ago $148,000$235,750
Linux Bash Python Ansible SLURM LSF UGE Kubernetes HPL NCCL MLPerf InfiniBand MPI Lustre GPFS BCM Terraform CI/CD
Remote

AI Benchmarking and Telemetry Engineer - NVIS

Nvidia

Remote (Santa Clara, CA) 119 days ago $184,000$287,500
Prometheus Grafana NVIDIA DCGM Linux system administration Python Bash CUDA Kubernetes Docker Slurm InfiniBand RoCE Lustre GPFS BeeGFS Weka VAST HPL HPCG MLPerf NCCL
Remote

AI Cloud Engineer

Genworth Financial

Raleigh, NC 86 days ago
AWS Terraform CI/CD Python Docker GitHub Actions PostgreSQL RDS Aurora S3 Lambda API Gateway CloudWatch IAM VPC WAF ECR ECS Glue Athena EMR DynamoDB Bash TDD BDD
Hybrid

AI Systems Hardware Engineer

Qualcomm

San Diego, CA +1 13 days ago $164,000$246,000
Python Docker Kubernetes Terraform AWS CI/CD PostgreSQL Prometheus Grafana SQL C++ Java Linux Git RESTful_APIs JSON YAML Scalability Performance_Tuning

AI Enablement Engineer

Electronic Arts

Vancouver, British Columbia, Canada +2 14 days ago $122,300$170,700
Python AWS C# JavaScript HTML CSS Cursor GitHub Copilot Claude Code Prompt and context engineering AI agents MCP AI assistants RAG Vector databases Model tuning CI/CD Observability Infrastructure as code Unity Unreal
Hybrid

Applied AI Engineer

Ramp

Remote (New York City, New York, US) 157 days ago $155,000$339,500
Python JavaScript Node.js Django Flask React PostgreSQL MongoDB AWS GCP Kubernetes Terraform CI/CD GitOps
Remote