Senior AI/ML Platform Engineer (LLM/SLM Inference)

Cisco

Actively hiring
Remote (Usa-San Jose, US) Posted 44 days ago $199,700$254,600 / year

At a glance

AI generated

TL;DR

As a Senior AI/ML DevOps Engineer at Cisco’s CX AI Incubation Team, you will play a pivotal role in productionizing large language and semantic models for intelligent customer experiences across cloud and on-prem environments. Your day-to-day responsibilities include building robust deployment pipelines with clear SLAs, optimizing inference performance across various hardware configurations, packaging on-prem inference stacks securely, designing scalable serving architectures, and implementing model observability features. You will work closely with product and engineering teams to ensure reliable and secure AI services, leveraging technologies such as PyTorch/TensorFlow, Kubernetes, CI/CD pipelines, and GPU profiling tools like vLLM and TensorRT-LLM. This role demands expertise in Python, Java or C++, hands-on experience with NLP systems, and a track record of operationalizing models at scale, particularly in resource-constrained environments.

Skills

Python PyTorch TensorFlow Kubernetes CI/CD MLOps Prometheus Grafana Docker Git AWS Azure Google Cloud Platform PostgreSQL MySQL NVIDIA GPUs vLLM Triton TensorRT-LLM llama.cpp

What you'll do

  • Build and deploy robust model-serving pipelines for LLM/SLM features with clear SLAs.
  • Optimize inference performance across various hardware configurations for cost efficiency.
  • Package on-prem inference stacks securely and integrate them into customer environments.
  • Design scalable serving architectures for multi-tenant, secure generative AI systems.
  • Implement automated CI/CD processes for models and prompts to ensure reproducibility.
  • Develop model and service observability features including latency metrics and safety checks.
  • Support training and fine-tuning workflows for LLMs/SLMs with data curation and tracking.

What we're looking for

  • 7+ years of experience in AI/ML DevOps or related field.
  • Strong background in Python, Java, and C++ for building production services.
  • Experience with PyTorch/TensorFlow and ML lifecycle tooling.
  • Proven track record deploying and operating NLP/Generative AI systems.
  • Hands-on expertise in GPU-backed inference and runtime optimization.
  • Familiarity with Kubernetes, CI/CD pipelines, model registries, and observability tools.
  • Ability to work effectively in cross-functional teams and communicate technical concepts.

Market check

Salary context

This $199,700–$254,600 range sits above 61% of similar postings on FindRole.

Peer median band

$169,375$261,300

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$176,000$246,150

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Cisco

Cisco Systems is the world''s leading networking technology company, designing and manufacturing networking hardware, telecommunications equipment, and cybersecurity solutions for businesses and governments. Industry: Networking Technology & Cybersecurity

Cisco currently has 113 open roles on FindRole.

Listed pay typically runs $165,000–$241,400 across 113 roles with salary data.

Most-posted roles

View all roles at Cisco

More like this

Similar roles

Senior ML/AI Engineer

Genworth Financial

US 27 days ago $114,900$114,900
Python Databricks MLflow Spark Delta_Lake Feature_Store CI/CD MLOps A/B_Testing Kubernetes AWS Azure SQL LLM RAG Prometheus Grafana

Senior Software Engineer - Applied AI/ML

Motorola Solutions

Chicago, Il, US 15 days ago $135,000$155,000
Python SQL Docker Kubernetes AWS Azure GCP MLOps CI/CD PyTorch Tensorflow Databricks MLFlow AWS SageMaker Hugging Face Apache Airflow Temporal RF rRay

Senior AI/ML Engineer, Build Platform

General Motors (GM)

Remote (Mountain View Technical Center - Mountain View Technical Center, US) 83 days ago $170,000$240,000
Python Docker Bazel RBE CAS CI/CD Kubernetes AWS Git C++ Go Prometheus Grafana
Remote

Senior Machine Learning Engineer, AI Platform

Adobe

San Jose, US 16 days ago $211,800$306,625
Python Java C++ Cloud Infrastructure Distributed Computing Deep Learning Virtual Reality Augmented Reality Artificial Intelligence Robotics Interactive Experiences Large-Scale Computing Frameworks Data Analysis Systems Modeling Environments