Principal ML Engineer - Large Scale Training Performance Optimization in San Jose, California | Advanced Micro Devices, Inc
Amd
Quick summary
Market check
How this pay compares to similar roles
This role pays more than 67% of similar roles. Most pay $177,737–$246,150 — the shaded band above. At the midpoint, this role pays about $240k versus about $212k for comparable roles.
Based on 240 similar postings.
Employer
AMD (Advanced Micro Devices) is a semiconductor company that develops high-performance processors, graphics cards, and adaptive computing solutions for gaming, data centers, and embedded markets. Industry: Semiconductors
Amd currently has 65 open roles on FindRole.
Listed pay typically runs $188,000–$188,000 across 65 roles with salary data.
Most-posted roles
At a glance
As a Principal GenAI Inference Optimization Engineer on the Models and Applications team, you will focus on enhancing performance, efficiency, and scalability of generative AI inference workloads on AMD GPU platforms. Your daily tasks include optimizing latency, throughput, and cost for large-scale model deployments in production environments, analyzing bottlenecks across compute, memory, and communication layers, and implementing advanced optimization techniques such as batching strategies and quantization. You will collaborate with hardware, compiler, and framework teams to drive cross-stack optimizations and contribute to the development of scalable serving systems using tools like vLLM, SGLang, Triton, or similar frameworks. Proficiency in Python, C++, CUDA/HIP, and experience with ML frameworks such as PyTorch are essential, along with a deep understanding of GPU architecture and performance fundamentals.
Skills
What you'll do
What we're looking for
More like this
Amd
Amd
Amd
Amd
Amd
Capital One Financial