Senior Software Engineer, At Scale Compute Analysis

Nvidia

Remote

Quick summary

Work type
Remote
Location
Santa Clara, CA
Salary
$152,000–$241,500 / yr
Posted
5 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $180k
This role $197k
$126k most similar roles pay here $254k

This role pays more than 64% of similar roles. Most pay $142,400–$217,725 — the shaded band above. At the midpoint, this role pays about $197k versus about $180k for comparable roles.

Based on 240 similar postings.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 985 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 971 roles with salary data.

Most-posted roles

View all roles at Nvidia

At a glance

TL;DR · Senior Software Engineer, At Scale Compute Analysis

As a senior data analyst on NVIDIA’s GPU-accelerated clusters team, you will analyze large-scale datacenter workloads to identify application and platform improvement opportunities. You’ll collaborate with OS, container, GPU, and systems engineers to apply machine learning techniques for categorization and forecasting, integrating these insights into practical tools used by the team. Your daily tasks include working with high-dimensional datasets to spot trends, correlate changes with known events, summarize findings, and communicate results effectively to both technical teams and leadership. You must have 5+ years of experience in complex dataset analysis, strong Python and JavaScript skills, proficiency with telemetry stacks like Grafana and Elasticsearch, and a solid understanding of machine learning concepts. Experience in Linux environments, high-performance computing, and visualizing high-dimensional problems is also essential for this role.

What you'll do

  • Analyze large-scale datacenter workloads to identify improvement opportunities for applications and platforms.
  • Use machine learning techniques to categorize and forecast workload trends from high-dimensional datasets.
  • Develop practical visualizations and lightweight implementations of ML/DL models within existing software workflows.
  • Collaborate with OS, container, GPU, and systems engineers to clarify analysis questions and document methods.
  • Communicate clear findings and actionable insights from complex data analyses to engineering teams and leadership.

What we're looking for

  • 5+ years of experience analyzing complex datasets and debugging data issues.
  • BS or MS in Engineering, Mathematics, Physics, Computer Science, or equivalent.
  • Strong proficiency in Python and JavaScript for data analysis tasks.
  • Experience with telemetry/observability tools like Grafana, Elasticsearch, Splunk.
  • Hands-on application of machine learning concepts and quick learning ability.
  • Diligent and action-biased approach to conducting thorough analyses.

More like this

Similar roles

Senior ML Infrastructure Engineer (Compute)

General Motors (GM)

Remote (Gm Automation - Sunnyvale - Gm Automation - Sunnyvale, US) 11 days ago
Go AWS GCP Azure Docker Kubernetes CI/CD Prometheus Grafana PostgreSQL Redis HPC GPU Telemetry Python C++ REST GraphQL GitLab
Remote

Senior Software Engineer, Big Data

Zillow

Remote (Remote-Wa, US) 82 days ago $160,900$257,100
Kafka Flink Python Java Scala AWS Kubernetes Terraform CI/CD SLOs Prometheus Grafana Databricks Confluent MSK Redpanda
Remote

Software Engineer, High Performance Computing

SpaceX

Hawthorne, CA 4 days ago $125,000$145,000
C++ Linux TCP UDP CI/CD Python ARM PowerPC x86 Microcontrollers Networking Satellite Constellation Performance Security Reliability Autonomous Systems Computer Architecture Networks High Performance Computing

Principal Software Engineer - Compute Infrastructure

Nvidia

Remote (Santa Clara, CA) 26 days ago $248,000$391,000
Kubernetes OpenShift Terraform Go Python GitOps ArgoCD AWS GCP NFSv4 NVMe/TCP Hyperconverged storage CI/CD Microservices Self-service architecture SLAs
Remote