LLM Serving Engineer (Cloud AI Engineering), Senior / Staff Engineer
Qualcomm
At a glance
AI generatedAs an AI Performance Engineer at Qualcomm Technologies, you will join a dynamic team developing cutting-edge hardware and software solutions for Cloud AI inference acceleration. Your day-to-day responsibilities include converting and optimizing models using PyTorch and ONNX, analyzing performance of large language and vision models, and mapping next-generation workloads onto current and future hardware designs. You will collaborate closely with internal teams and customers to drive innovative engineering solutions that enhance the efficiency and scalability of AI workloads. Essential skills for this role include hands-on experience in building and optimizing language models, a deep understanding of transformer architectures and attention mechanisms, proficiency in Python programming, and knowledge of computer architecture and ML accelerators. Bonus points for familiarity with machine learning compilers like torch.compile or torchDynamo and expertise in neural network operators and mathematical operations.
Skills
What you'll do
What we're looking for
Market check
This $178,400–$267,600 range sits above 57% of similar postings on FindRole.
Peer median band
$170,800–$258,500
Median floor and ceiling across peers.
Typical midpoint (25–75%)
$177,250–$246,150
Middle half of comparable postings.
Based on 240 comparable postings.
* 240 is the maximum number of comparable postings sampled.
Employer
Qualcomm is a leading American semiconductor and telecommunications company based in San Diego, CA.
Qualcomm currently has 567 open roles on FindRole.
Listed pay typically runs $148,300–$226,100 across 534 roles with salary data.
Most-posted roles
More like this
Qualcomm
Qualcomm
Qualcomm
Qualcomm
CVS Health
Citi