Senior AI Research Quantization Engineer

Qualcomm

Actively hiring
San Diego, CA Posted 47 days ago $140,800$211,200 / year

At a glance

AI generated

TL;DR

Qualcomm AI Research seeks a senior algorithm engineer to join its high-caliber team focused on developing advanced machine learning technologies and user-friendly model optimization tools like the Qualcomm Innovation Center’s AI Model Efficiency Toolkit. This role involves cutting-edge research in efficient generative AI, LLMs, LVMs, and multi-modal systems, with a focus on optimizing inference algorithms, quantization techniques, and model compression for both device and cloud environments. The ideal candidate will have expertise in Python and PyTorch, along with experience in machine learning algorithm development or systems engineering. This position offers the opportunity to collaborate across hardware, software, and systems disciplines, contributing to innovations that power next-generation smartphones, autonomous vehicles, robotics, and IoT devices.

Skills

Python PyTorch LLM LVM Multi-modal VLA Batching KV caching Efficient attentions Long context Speculative decoding Quantization algorithms Gradient-based optimization Non-gradient based optimization Equivalent transformation Non-equivalent transformation Automatic mixed precision Hardware in loop Model compression Lossy compression Lossless compression Structural search Neural search Generative AI system prototyping

What you'll do

  • Develop efficient generative AI algorithms for on-device and cloud applications.
  • Design advanced quantization techniques to optimize complex generative models.
  • Conduct research on model compression methods to enhance efficiency.
  • Prototype systems for generative AI to improve performance and usability.
  • Implement inference algorithms to enable efficient processing of large datasets.
  • Collaborate on system innovations to advance model efficiency across devices.

What we're looking for

  • Experience in developing efficient generative AI algorithms and large language models.
  • Proficient in advanced quantization techniques for complex generative models.
  • Expertise in model compression methods including lossy and lossless techniques.
  • Strong background in Python and PyTorch programming for machine learning.
  • Knowledge of efficient inference algorithms such as batching, KV caching, and attention mechanisms.
  • Experience with system prototyping for on-device and cloud-based AI solutions.
  • Master's or PhD degree in Computer Science, Engineering, or related field.

Market check

Salary context

This $140,800–$211,200 range sits above 26% of similar postings on FindRole.

Peer median band

$171,700$261,300

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$175,375$246,150

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Qualcomm

Qualcomm is a leading American semiconductor and telecommunications company based in San Diego, CA.

Qualcomm currently has 595 open roles on FindRole.

Listed pay typically runs $148,300–$222,500 across 540 roles with salary data.

Most-posted roles

View all roles at Qualcomm

More like this

Similar roles

Senior Quantum AI Research Scientist, Applied Research

Nvidia

Redmond, WA 11 days ago $192,000$304,750
Python PyTorch CUDA NVIDIA GPUs Quantum Information Science Deep Learning Machine Learning Graph Neural Networks Reinforcement Learning High-Performance Computing Distributed Training Frameworks HPC Environments LoRA QLoRA Adapters Superconducting Qubits Trapped Ions Fault-Tolerant Quantum Systems
Hybrid

Senior AI Scientist

Intuit

Mountain View, CA 45 days ago $173,500$234,500
Python scikit-learn R SQL Hive SparkSQL Linux data mining clustering classification regression decision trees neural nets support vector machines anomaly detection recommender systems sequential pattern discovery text mining A/B testing statistical analysis

Senior Applied AI Engineer

Equifax

Alpharetta, GA 21 days ago
GCP Kubernetes Terraform Java Spring Boot Apache Beam Bigtable BigQuery PubSub GCS Composer Airflow Jenkins Helm CI/CD Python SQL NLP LLM CRMs APIs Event-driven architectures MLOps Docker Prometheus Grafana
Hybrid

Senior AI Machine Learning Engineer

The Hartford

Chicago, IL 13 days ago $117,200$175,800
AWS GCP SageMaker Streamlit Python Java C# Hadoop Spark Redshift Snowflake BigQuery Jenkins Terraform GitHub GitHub Actions Apache Airflow Kubernetes Docker SQL CI/CD MLOps
Hybrid

Senior Distinguished AI Engineer

Capital One Financial

San Francisco, CA 68 days ago $314,800$359,300
Python Go Scala Java CI/CD AWS Kubernetes Terraform Docker Prometheus Grafana

Senior Staff AI Research Scientist

Intuit

Mountain View, CA 46 days ago $226,000$306,000
Python PyTorch TensorFlow NeurIPS ICML ICLR AAAI KDD ACL Decision-focused AI Probabilistic modeling Causal inference Simulation-based planning Agentic and multi-agent systems Neuro-symbolic AI LLM-based reasoning Deep learning Optimization Statistical machine learning