AI Platforms Leader Enterprise AI Platforms

Qualcomm

Actively hiring
San Diego, CA Posted 77 days ago $198,500$297,700 / year

At a glance

AI generated

TL;DR

Qualcomm is hiring an AI Platforms Leader to oversee the strategy, architecture, and operation of its comprehensive end-to-end AI Platform, encompassing both on-prem GPU clusters and cloud services across AWS, GCP, and Azure. This role involves leading a high-caliber engineering team in delivering reliable and cost-efficient infrastructure for training, fine-tuning, inference, retrieval, and agentic orchestration using MCP servers. Key responsibilities include defining the multi-year vision for a hybrid AI platform, optimizing GPU-based compute at scale, implementing MLOps and LLMOps as products, and ensuring operational excellence with robust DevOps practices such as GitOps and IaC (Terraform/Bicep/Helm). The ideal candidate has extensive experience in building large-scale platforms, leading engineering teams, and hands-on expertise with GPU clusters, Kubernetes, and cloud AI/ML services. Knowledge of tools like PyTorch, Triton Inference Server, and MCP is preferred, along with a strong background in MLOps, security, and global collaboration.

Skills

AWS Azure GCP Kubernetes MLOps LLMOps CI/CD Terraform GitOps IaC Docker Prometheus Grafana MCP Slurm PyTorch CUDA cuDNN Triton_Inference_Server vLLM KServe Ray MLflow Vertex SageMaker Airflow Argo Weights_and_Biases LangSmith OIDC RBAC OPA AWS_Secrets_Manager Azure_Key_Vault FAISS Milvus Pinecone Feast

What you'll do

  • Own the multi-year vision for a hybrid AI platform, aligning to business needs, developer productivity, and cost efficiency.
  • Operate and optimize on-prem GPU clusters for high-throughput storage and networking.
  • Deliver MLOps and LLMOps as self-service product capabilities with CI/CD automation.
  • Design and operate agentic orchestration systems and MCP servers for secure enterprise tool integration.
  • Establish multi-cloud patterns for AI/ML platforms, ensuring portability and resilience across AWS/GCP/Azure.
  • Lead a global engineering team in platform services development and DevOps excellence practices.

What we're looking for

  • 15+ years of engineering/technology experience with ~10 years in large-scale platform operations.
  • Proven leadership in managing a team of ~10 engineers for at least 5 years.
  • Expertise in operating on-prem GPU clusters and optimizing performance and reliability.
  • Extensive experience in MLOps, including model lifecycle management and observability.
  • Deep knowledge of cloud AI/ML services and managed Kubernetes across AWS/GCP/Azure.
  • Strong background in DevOps practices, including CI/CD, GitOps, IaC, and secure SDLC.

Market check

Salary context

This $198,500–$297,700 range sits above 80% of similar postings on FindRole.

Peer median band

$172,000$257,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$168,000$246,150

Middle half of comparable postings.

Based on 239 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Qualcomm

Qualcomm is a leading American semiconductor and telecommunications company based in San Diego, CA.

Qualcomm currently has 595 open roles on FindRole.

Listed pay typically runs $148,300–$222,500 across 540 roles with salary data.

Most-posted roles

View all roles at Qualcomm

More like this

Similar roles

AI Architecture & Governance Leader Enterprise AI Platforms

Qualcomm

San Diego, CA 77 days ago $192,600$289,000
Kubernetes LLM serving engines Agentic automation frameworks RPA Enterprise AI Governance MLOps Vector databases Feature stores Observability AI quality monitoring Security Compliance TOGAF GPU scheduling Cloud services APIs Eventing Microservices Identity/authorization Mobile backends Technical Product Manager DevSecOps

Head of AI Developer Platform

Blackrock

New York 46 days ago $275,000$350,000
CI/CD Kubernetes Docker Python Java PostgreSQL AWS Azure Grafana Prometheus GitLab Jenkins Terraform Ansible Chaos Engineering Responsible AI Context Graphs Test Data Management Pre-Prod Environment Management
Hybrid

Lead Principal Engineer, Enterprise Agentic AI Platform

Nvidia

Santa Clara, CA 95 days ago $272,000$431,250
Python Go Kubernetes LangChain LangGraph Terraform CI/CD Prometheus Grafana PostgreSQL Redis Docker GitLab AWS Azure NVIDIA硬件 GPU加速 容器化工作负载 网络API 安全企业集成模式 基准测试 回归测试 遥测系统 可观测性系统 性能调优 混合环境 向量数据库 检索系统 Glean Microsoft Copilot Studio Google Agentspace LangChain框架 多代理管理 持续集成/持续部署 SDK开发 API设计 参考实现 企业级AI生态系统 GPU推理系统优化

AI Platform Engineer, Senior

Booz Allen Hamilton

Laurel, Maryland 45 days ago $86,800$198,000
AWS Python Kubernetes Prometheus Grafana OpenTelemetry CI/CD

AI Governance Technology Lead

Global Payments (TSYS)

Alpharetta, GA 61 days ago
Python SQL AI Governance Platforms Risk Assessment Tools Synthetic Data Generation Automated Testing Frameworks Anomaly Detection AI Observability Tools Bias Audits Adversarial Testing Stress Testing Statistical Testing XAI Techniques CI/CD PostgreSQL MLOps EU AI Act GDPR CCPA NIST AI RMF