Principal Software Quality Engineer – GPU & Machine Learning in San Jose, California | Advanced Micro Devices, Inc

Amd

Hybrid

Quick summary

Work type
Hybrid
Location
CA
Salary
$210,400–$210,400 / yr
Posted
26 days ago
Closes
Feb 26, 2027

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $207k
This role $210k
$157k most similar roles pay here $252k

This role pays more than 50% of similar roles. Most pay $177,250–$236,975 — the shaded band above. At the midpoint, this role pays about $210k versus about $207k for comparable roles.

Based on 240 similar postings.

Employer

About Amd

AMD (Advanced Micro Devices) is a semiconductor company that develops high-performance processors, graphics cards, and adaptive computing solutions for gaming, data centers, and embedded markets. Industry: Semiconductors

Amd currently has 63 open roles on FindRole.

Listed pay typically runs $200,000–$200,000 across 63 roles with salary data.

Most-posted roles

View all roles at Amd

At a glance

TL;DR · Principal Software Quality Engineer – GPU & Machine Learning in San Jose, California | Advanced Micro Devices, Inc

As a Principal Software Quality Engineer at AMD, you will lead the technical direction of ROCm software validation across compute workloads and server-class systems, ensuring quality standards for hyperscalers, OEMs, and open-source users. Your daily tasks include defining end-to-end validation architecture, setting release-qualification criteria, leading system-level testing, driving workload validation, architecting test infrastructure, and championing modern quality engineering practices. You will use Python and C++ extensively, with expertise in GPU compute software stacks, deep-learning frameworks, HPC runtimes, Linux kernel/GPU drivers, and distributed systems. Ideal candidates have a strong background in complex system validation, hands-on experience with GitHub at scale, and proficiency in AI-driven workflows for engineering tasks. This role is crucial for AMD's strategic Instinct GPUs, impacting millions of production GPUs worldwide.

What you'll do

  • Own end-to-end validation architecture for ROCm across various GPU generations and server platforms.
  • Define release-qualification criteria and drive the organization to meet them for ROCm software releases.
  • Lead system-level testing for multi-GPU topologies, fabric bring-up, and validation on AMD Instinct™ GPU platforms.
  • Drive compute workload validation and establish reproducible methodology for benchmarks like MLPerf.
  • Architect test infrastructure including distributed runners, CI fleets, and flaky-test detection systems.
  • Mentor senior validation engineers and elevate technical standards through design reviews and written guidance.

What we're looking for

  • 5+ years of senior-level experience in software validation or quality engineering.
  • BS/MS/PhD in Computer Science, Engineering, or related field.
  • Expertise in Python for test automation and C++ for debugging complex systems.
  • Deep validation experience in GPU compute stacks, deep-learning frameworks, HPC runtimes, Linux kernel/GPU drivers, distributed systems, or large-scale cluster software.
  • Proven ability to define and implement release qualification programs for Tier-1 customers.
  • Mastery of GitHub at scale for quality engineering practices.
  • Strong command of modern agile software development practices applied to validation.

More like this

Similar roles