LLM Serving Engineer (Cloud AI Engineering), Senior / Staff Engineer
Qualcomm
Quick summary
Market check
How this pay compares to similar roles
This role pays more than 82% of similar roles. Most pay $162,000–$246,150 — the shaded band above. At the midpoint, this role pays about $251k versus about $204k for comparable roles.
Based on 239 similar postings.
Employer
Qualcomm is a leading American semiconductor and telecommunications company based in San Diego, CA.
Qualcomm currently has 621 open roles on FindRole.
Listed pay typically runs $148,300–$224,400 across 562 roles with salary data.
Most-posted roles
At a glance
The Qualcomm Cloud AI team is hiring an experienced engineer to contribute to the development of software solutions for inference acceleration. This role involves working across the entire product lifecycle from R&D to commercial deployment, requiring strong communication and cross-functional collaboration skills. The ideal candidate will have a track record of delivering large-scale commercial software projects and experience with frameworks like vLLM. Key responsibilities include designing, compiling, and optimizing neural networks for multicore systems, as well as performance modeling of SoC architectures. Proficiency in PyTorch, C++, Python, and understanding multi-core processor architecture is essential, along with expertise in LLMs, multi-modal models, and reasoning models. This position demands a deep knowledge of machine learning accelerators and neural network operators, making it ideal for those passionate about advancing AI inference technology at scale.
Skills
What you'll do
What we're looking for
More like this
Qualcomm
Qualcomm
Capital One Financial
CrowdStrike
The Hartford
Capital One Financial