Sr. Staff Engineer, Machine Learning Engineering (Quantization SW)

Qualcomm

Actively hiring
San Diego, CA Posted 25 days ago $178,400$267,600 / year

At a glance

AI generated

TL;DR

Join Qualcomm Technologies, Inc.'s AI Research team as a Senior Software Engineer specializing in advancing Gen AI Technology for the Edge, focusing on model fine-tuning, hardware acceleration, and edge inference. You will work in a dynamic research environment alongside multi-disciplinary teams of researchers and software engineers using cutting-edge AI frameworks like PyTorch and ONNX. Your daily tasks include architecting, designing, developing, and testing techniques such as graph optimization, pruning, and quantization to enhance model performance on devices ranging from smartphones to autonomous vehicles. Ideal candidates possess strong software engineering skills, a solid foundation in AI and ML, hands-on experience with model optimization frameworks like Huggingface Optimum and OpenVino, and the ability to establish high-quality software delivery processes using industry best practices.

Skills

Python PyTorch ONNX Huggingface_Optimum OpenVino CI/CD Git Agile ML_frameworks Model_optimization Quantization Pruning Edge_inference Neural_networks Code_review Automation

What you'll do

  • Architect and design model optimization techniques for AI frameworks.
  • Develop and test graph optimization, pruning, and quantization methods.
  • Evaluate and optimize Generative AI workflows for performance and accuracy.
  • Implement ML model optimization using tools like PyTorch and ONNX.
  • Deploy GenAI LLM/LVM models on edge devices with strong Python skills.
  • Establish high-quality software delivery processes using industry best practices.

What we're looking for

  • Strong background in AI and machine learning techniques.
  • Proven experience evaluating and optimizing Generative AI workflows.
  • Hands-on experience with ML model optimization frameworks and techniques.
  • Expertise in Python design and implementation for AI projects.
  • Experience deploying GenAI models on edge devices and profiling them.
  • Knowledge of neural networks and familiarity with PyTorch, ONNX.
  • Strong software engineering skills and agile development experience.

Market check

Salary context

This $178,400–$267,600 range sits above 62% of similar postings on FindRole.

Peer median band

$155,420$241,500

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$169,463$244,000

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Qualcomm

Qualcomm is a leading American semiconductor and telecommunications company based in San Diego, CA.

Qualcomm currently has 567 open roles on FindRole.

Listed pay typically runs $148,300–$226,100 across 534 roles with salary data.

Most-posted roles

View all roles at Qualcomm

More like this

Similar roles

Sr Staff Machine Learning Engineer

PayPal

Usa - California - San Jose - Corp - N First St, US 73 days ago $218,000$323,950
Python Scikit-learn TensorFlow Keras Pandas NumPy AWS Google Cloud Platform Azure Docker Kubernetes CI/CD Git Jupyter Notebook PostgreSQL MongoDB

Senior Staff Machine Learning Engineer

Intuit

Mountain View, California, US 43 days ago $214,000$289,500
AWS GCP TensorFlow PyTorch Spark Kubernetes MLflow RAG LLM CI/CD MLOps Python Docker Prometheus PostgreSQL

Senior Staff Machine Learning Engineer

GEICO

Md Bethesda Office, US 32 days ago $150,000$300,000
Python AWS Azure Kubernetes Airflow Snowflake PostgreSQL MongoDB Cassandra Spark Ray MLflow Kubeflow Feast Prometheus Grafana OpenTelemetry CI/CD ElasticSearch Qdrant Parquet Delta Iceberg Flink SHAP LIME

Senior Staff Machine Learning Engineer

GEICO

Ca Palo Alto Office, US 39 days ago $150,000$300,000
Python Java C++ AWS Azure Kubernetes CI/CD Elasticsearch Snowflake Kafka PostgreSQL MongoDB Cassandra Spark Ray Airflow Temporal LLMs GPT Generative AI

Senior Staff Machine Learning Engineer

GEICO

Md Bethesda Office, US 32 days ago $150,000$300,000
Python Java C++ AWS Azure Kafka Spark Ray Airflow Temporal PostgreSQL MongoDB Cassandra ElasticSearch Qdrant Snowflake Parquet Delta Iceberg MLflow Kubeflow Feast Prometheus Grafana OpenTelemetry CI/CD Kubernetes