Principal AI Performance Engineer

Amd

Hybrid

Quick summary

Work type: Hybrid
Location: San Jose, CA
Posted: 108 days ago
Closes: Mar 11, 2027
Nearby: 99+ roles within 25 mi

Market check

Salary context

How this pay compares to similar roles

Similar $207k

$158k most similar roles pay here $256k

This listing doesn't post a salary. Most similar roles pay $167,550–$246,150.

Based on 240 similar postings.

Employer

About Amd

AMD (Advanced Micro Devices) is a semiconductor company that develops high-performance processors, graphics cards, and adaptive computing solutions for gaming, data centers, and embedded markets. Industry: Semiconductors

Amd currently has 56 open roles on FindRole.

Most-posted roles

View all roles at Amd

At a glance

TL;DR · Principal AI Performance Engineer

Role Posting Log in to save

AMD seeks a Principal AI Performance Engineer to lead a small technical team in optimizing AI inference performance on AMD GPUs for strategic customer engagements. This role involves end-to-end stack optimization of leading models and configurations, from profiling and diagnosing kernel-level bottlenecks to presenting optimizations to senior stakeholders. The ideal candidate has deep expertise in GPU computing, AI serving frameworks like vLLM and SGLang, and proficiency with Python and C++. They must excel at customer-facing technical leadership, leveraging AI agents daily to enhance workflows while developing reusable optimization methodologies. This position demands a performance-obsessed mindset, tackling complex challenges across multi-node distributed systems and leaving measurable impacts on AMD’s competitive edge in the AI market.

Skills

Python C++ vLLM SGLang TensorRT-LLM HIP CUDA Triton CK Linux GPU AI agents CI/CD PyTorch Kubernetes

What you'll do

Drive end-to-end performance optimization on AMD GPUs for leading AI models.
Profile and resolve complex cross-stack bottlenecks in GPU kernels and frameworks.
Diagnose kernel-level issues using profiling tools to enhance model performance.
Lead customer engagements by presenting technical findings and optimizations.
Develop custom kernels within serving frameworks to improve dispatch efficiency.
Optimize multi-node distributed inference for better communication-compute overlap.
Define and refine performance optimization methodologies for the broader team.

What we're looking for

7+ years of software development experience in GPU computing, AI systems, or high-performance computing.
Deep hands-on experience with AI serving frameworks and their internals, including vLLM, SGLang, TensorRT-LLM.
Strong background in end-to-end workload profiling and bottleneck diagnosis from user request to GPU kernel.
Expertise in GPU kernel performance characteristics such as occupancy, memory coalescing, cache utilization, and instruction-level bottlenecks.
Experience with custom kernel development or integration using HIP, CUDA, Triton, CK, or similar technologies.
Understanding of multi-GPU and multi-node distributed systems, including scale-up and scale-out topologies, RDMA, and communication-compute overlap.
Fluent in AI-assisted development, leveraging AI agents and tools daily to accelerate workflows.

Similar roles

Principal Software Development Engineer, AI Performance

Amd

San Jose, CA 121 days ago

CUDA HIP Python C++ LLVM MLIR Triton Gluon PyTorch vLLM SGLang xDiT Megatron LM Linux GPU HPC AI systems roofline analysis performance engineering multi-GPU communication

Hybrid

Save

AI Systems Performance Engineer

Broadcom

San Jose, CA 69 days ago $141,300–$226,000

Linux Python C++ PyTorch MLPerf NCCL Ethernet RDMA RoCEv2 CI/CD Docker Kubernetes

Save

Principal AI Engineer

Salesforce

Remote (San Francisco, CA) +4 30 days ago $197,300–$313,700

AWS Python GitHub Actions ArgoCD Terraform Docker Kubernetes Grafana Braintrust LangSmith CI/CD AgentOps Salesforce Ecosystem Vector Databases Graph Databases RAG Pipelines Snowflake Kafka Flink

Remote

Save

Principal AI Engineer

Salesforce

New York +4 31 days ago $218,400–$365,200

Salesforce Distributed Systems CI/CD Infrastructure-as-Code API Integration AI Agents LLM Workflows Automated Testing Observability Event-Driven Design Microservices Security & Compliance Prompt Engineering System Context Design Evaluation Frameworks GitHub Copilot Claude Code Cursor Salesforce Marketing Cloud Agentforce Google Workspace Slack

Save

Principal AI Development Engineer

Broadcom

San Jose, CA 67 days ago $118,800–$190,000

Python Kubernetes Docker CI/CD TDD GitHub LangGraph CrewAI Pinecone Weaviate Milvus Vector_Databases Frontend_Development Backend_Development Test_Automated Version_Control Modern_Enterprise_Grade_Languages

Save