Senior CPU Performance Architect

Nvidia

Actively hiring
Us, Ca, Santa Clara, US Posted 50 days ago $224,000$356,500 / year

At a glance

AI generated

TL;DR

Join the CPU performance architecture team as a senior specialist to drive the development of cutting-edge CPU technology for diverse applications including AI/ML, HPC, CSP, gaming, VR, and autonomous vehicles. Your daily tasks will involve workload bring-up and performance analysis on silicon and full-system simulators, studying real-world use-cases to identify critical application behavior, debugging performance bottlenecks in multi-core and multi-socket systems, and collaborating with architects to enhance future CPU designs based on your insights. Ideal candidates have a BS/MS in Electrical Engineering, Computer Science, or equivalent experience, along with 12+ years of relevant industry background, deep knowledge of CPU microarchitecture, and proficiency in performance test development and benchmarking. Familiarity with ARM ISA, GPU driver experience, and expertise in AI frameworks like PyTorch are advantageous as NVIDIA continues to innovate in the CPU server market, integrating seamlessly with its GPUs and SoCs for superior performance efficiency across various industries.

Skills

Python C++ ARM PyTorch NVIDIA GPU HPC AI DL CI/CD Linux Performance_Benchmarking CPU_Microarchitecture System_Architecture Simulator Multi_Core_Systems Interconnect_Architecture Performance_Optimization Benchmarking ISA

What you'll do

  • Conduct workload bring-up and performance analysis on silicon and full-system simulators.
  • Study real-world use-cases to identify critical application behavior and create test cases.
  • Analyze and debug performance scaling bottlenecks in multi-core and multi-socket CPU systems.
  • Work with architects to improve future CPU designs based on performance findings.
  • Benchmark NVIDIA’s CPUs against competitors and recommend software or hardware improvements.

What we're looking for

  • BS/MS in Electrical Engineering, Computer Science, or equivalent experience.
  • 12+ years of relevant experience in CPU performance analysis and workload study.
  • Deep knowledge of CPU microarchitecture and system architecture.
  • Experience with ARM instruction set architecture (ISA).
  • Proficiency in benchmarking and test development for CPU and I/O performance.

Market check

Salary context

This $224,000–$356,500 range sits above 97% of similar postings on FindRole.

Peer median band

$160,154$241,500

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$167,900$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 802 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior CPU Workloads and Simulation Architect

Nvidia

Us, Ca, Santa Clara, US 30 days ago $224,000$356,500
C/C++ Python ARM SimPoint Pytorch TensorFlow Sampling methodology Data science User-mode and kernel-mode drivers Functional and performance simulators CPU/GPU application development NVIDIA Grace CPU Superchip Vera CPU

Principal CPU Power Architect

Nvidia

Us, Ca, Santa Clara, US 13 days ago $272,000$431,250
RTL VLSI CPU Architecture Power Consumption Process Technologies Circuit Design Low Power Techniques Physical Design GPU Design SOC Design Advanced Packaging Technologies AI HPC Cloud Deployments CI/CD

Senior Solutions Architect, Datacenter CPUs

Nvidia

Us, Ca, Santa Clara, US 45 days ago $184,000$287,500
Arm Linux C C++ Python SPEC CPU MLPerf AWS Azure GCP Kubernetes Docker CI/CD Prometheus Grafana Terraform

Senior Developer Technology Engineer, CPU Performance

Nvidia

Remote (Us, Ca, Santa Clara, US) 44 days ago $152,000$241,500
C/C++ CPU architecture ARM x86 memory subsystem cache DRAM storage parallel programming vectorization concurrency distributed database systems Spark compression storage systems networking distributed computer architectures GPU architectures CI/CD
Remote

Senior Accelerated Computing Architect

Nvidia

Us, Ca, Santa Clara, US 21 days ago $184,000$287,500
CUDA C++ C MPI OpenSHMEM Python Linux GPU CPU Benchmarking Profiling IPC_APIs OpenCL NVSHMEM

Senior HPC Solutions Architect

Nvidia

Remote (Us, Ca, Santa Clara, US) 48 days ago $184,000$287,500
Python C++ CUDA SLURM Linux BMC PCIe Network_Adapters InfiniBand DPU RoCE ARM Linux_Kernel Drivers SDN C
Remote