AI Accuracy Architect

Qualcomm

Actively hiring
San Diego, CA Posted 77 days ago $158,400$237,600 / year

At a glance

AI generated

TL;DR

As a Staff Engineer – AI Accuracy Architect at Qualcomm Technologies, you will lead the development of accuracy-centric architecture and optimization for large language models (LLMs), vision-language models (VLMs), and multimodal models, collaborating closely with compiler, performance, and model optimization teams. Your daily tasks include designing quantization strategies such as PTQ, QAT, mixed precision, and per-channel/group-wise approaches to ensure optimal accuracy while balancing performance and hardware constraints. You will analyze numerical stability issues across various layers of the inference stack, from kernels to runtimes, and work with PyTorch and ONNX for model conversion and deployment. This role requires expertise in transformer architectures, attention mechanisms, and mixed-precision training, along with strong Python skills and a solid understanding of computer architecture and ML accelerators.

Skills

Python PyTorch ONNX LLMs VLMs Quantization Transformer_architectures Attention_mechanisms Precision_tradeoffs Numerical_stability Accuracy_evaluation_metrics ML_compilers Torch.compile Computer_architecture ML_accelerators

What you'll do

  • Own architecture for LLM, VLM, and multimodal inference accuracy.
  • Lead Day0 enablement of cutting-edge models on Qualcomm AI platforms.
  • Design and evaluate quantization strategies to balance accuracy and performance.
  • Analyze and resolve numerical stability issues across the inference stack.
  • Define and implement accuracy evaluation metrics and tooling.

What we're looking for

  • Extensive hands-on experience with LLMs and VLMs in production environments.
  • Expert understanding of quantization techniques and numerical precision trade-offs.
  • Deep knowledge of transformer architectures and attention mechanisms.
  • Proven ability to balance accuracy, performance, and hardware constraints.
  • Experience across compiler, kernel, and hardware abstraction layers.
  • Strong Python skills for scaling accuracy experiments and evaluations.

Market check

Salary context

This $158,400–$237,600 range sits above 49% of similar postings on FindRole.

Peer median band

$161,350$241,500

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$162,000$240,800

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Qualcomm

Qualcomm is a leading American semiconductor and telecommunications company based in San Diego, CA.

Qualcomm currently has 564 open roles on FindRole.

Listed pay typically runs $148,300–$224,400 across 531 roles with salary data.

Most-posted roles

View all roles at Qualcomm

More like this

Similar roles

AI Solution Architect

Booz Allen Hamilton

Locations Nellis Afb, Nevada, US 18 days ago $112,800$257,000
Palantir Foundry Palantir Gotham Kubernetes DevSecOps CI/CD Docker LLM AI/ML DevOps Secret clearance Top Secret clearance AWS

AI Architect

Fiserv

Alpharetta, Georgia, US 15 days ago
Python TensorFlow PyTorch Pandas AWS Git DevOps APIs Microservices Splunk SSRS Cognos Power BI Azure Google Cloud Kubernetes CI/CD

Artificial Intelligence Solutions Architect

Booz Allen Hamilton

US 66 days ago $69,400$158,000
Python PySpark Hive SageMaker Studio Bedrock AWS LLMs GenAI Databricks CI/CD Kubernetes Terraform GCP Azure

Solutions Architect, AI Models

Nvidia

Remote (Us, Ca, Santa Clara, US) 41 days ago $152,000$241,500
Python PyTorch TensorFlow Hugging Face Transformers Kubernetes SLURM Docker CI/CD Prometheus Grafana PostgreSQL Git Jupyter Notebook NVIDIA NeMo NVIDIA Nemotron Linux AWS Azure Google Cloud Platform
Remote

AI Systems Engineer and Solutions Architect

Booz Allen Hamilton

Locations Mclean, Virginia, US 66 days ago $112,800$257,000
Python Java C++ C# MBSE AI applications Autonomous platforms SysML UML Cloud architecture CI/CD

Solutions Architect, Applied AI Builder

Nvidia

Us, Ca, Santa Clara, US 55 days ago $152,000$241,500
Python TypeScript Go Rust C++ Synthetic data generation GPU-backed inference systems Multi-agent workflows Orchestration patterns Complex long-running task systems MCP A2A-style communication patterns Secure execution Sandboxing Secrets handling Auditability CI/CD