Principal AI Inference Systems Engineer
Amd
Quick summary
Market check
How this pay compares to similar roles
This listing doesn't post a salary. Most similar roles pay $173,200–$246,150.
Based on 240 similar postings.
Employer
AMD (Advanced Micro Devices) is a semiconductor company that develops high-performance processors, graphics cards, and adaptive computing solutions for gaming, data centers, and embedded markets. Industry: Semiconductors
Amd currently has 56 open roles on FindRole.
Most-posted roles
At a glance
As a Principal AI Infrastructure Solution Engineer at AMD, you will join the AI software team to design and validate Kubernetes architectures for large-scale LLM training and inference on AMD Instinct GPUs. Your daily tasks include architecting distributed training stacks, implementing gang scheduling, and optimizing GPU orchestration using tools like Kubeflow Training Operator and SLURM controllers. You will work closely with enterprise customers to deploy production-ready AMD GPU clusters, benchmark performance, and develop tuning guides for efficient communication and workload-specific optimizations. This role requires expertise in Kubernetes GPU orchestration, distributed training on Kubernetes, and hands-on experience with AI infrastructure at scale, making it ideal for someone with a strong background in deploying large-scale GPU clusters and enabling customers through complex platform deployments.
Skills
What you'll do
What we're looking for
More like this
Amd
Nvidia
Salesforce
Salesforce
Amd