Kubernetes Platform Engineer - AI Infrastructure

Cisco

Remote Hybrid Actively hiring
Remote · Research Triangle Park, NC · Dallas, TX · Allen, TX Posted 14 days ago $126,500$182,000 / year

At a glance

AI generated

TL;DR

As a Kubernetes Platform Engineer at Cisco’s AI Infrastructure team, you will design and operate large-scale on-prem Kubernetes platforms to support next-generation AI/ML workloads, including GPU-enabled environments for training and inference. Your responsibilities include architecting scalable multi-tenant infrastructure, building platform capabilities with custom controllers and operators using Golang, and implementing Infrastructure as Code practices. You’ll collaborate closely with data scientists and ML engineers to optimize workflows and ensure high-performance AI workloads while leveraging AIOps for automation and reliability. This role requires hands-on experience in Kubernetes control plane management, etcd operations, and deep knowledge of Kubernetes internals, along with proficiency in Go and strong debugging skills for large-scale distributed systems.

Skills

Kubernetes OpenShift Anthos Golang Python etcd Infrastructure as Code AIOps Prometheus Grafana CI/CD GPU ML pipelines CRDs Webhooks Observability On-call support

What you'll do

  • Design and build large-scale on-prem Kubernetes platforms for AI/ML workloads.
  • Architect scalable infrastructure to support multi-tenant environments for AI applications.
  • Enable high-performance GPU-based AI/ML workloads in Kubernetes clusters.
  • Implement Infrastructure as Code using Golang, CRDs, and Kubernetes controllers.
  • Ensure reliability through performance tuning and participation in on-call rotations.
  • Build platform capabilities with custom operators and services for ML pipelines.

What we're looking for

  • 5+ years of software engineering experience with AI/ML or GPU-based workloads on Kubernetes.
  • 3+ years operating Kubernetes in production, including control plane ownership and cluster upgrades.
  • Strong etcd management skills for backup, restore, and recovery operations.
  • Proficiency in Go for building Kubernetes controllers/operators, CRDs, and webhooks.
  • Deep understanding of Kubernetes internals like API server, scheduler, and controller loops.

Market check

Salary context

This $126,500–$182,000 range sits above 25% of similar postings on FindRole.

Peer median band

$152,000$234,800

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$155,000$235,750

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Cisco

Cisco Systems is the world''s leading networking technology company, designing and manufacturing networking hardware, telecommunications equipment, and cybersecurity solutions for businesses and governments. Industry: Networking Technology & Cybersecurity

Cisco currently has 103 open roles on FindRole.

Listed pay typically runs $165,000–$241,400 across 103 roles with salary data.

Most-posted roles

View all roles at Cisco

More like this

Similar roles

Kubernetes Platform Engineer – AI Infrastructure

Cisco

Remote (Usa-San Jose, US) 14 days ago $152,500$219,200
Kubernetes OpenShift Anthos etcd Golang Python Infrastructure as Code AIOps CRDs Controllers Operators Webhooks GPU-based workloads AI/ML pipelines Observability Telemetry CI/CD
Remote

Senior Kubernetes Platform Engineer - AI Infrastructure

Cisco

Remote (Usa-Research Triangle Park, US) 14 days ago $137,000$200,500
Kubernetes OpenShift Anthos etcd Go Infrastructure as Code AIOps telemetry Prometheus Grafana Kubeflow MLflow CI/CD Docker GitOps Terraform Ansible Python PostgreSQL
Remote

Senior Kubernetes Platform Engineer - AI/ML Infrastructure

Cisco

Remote (Usa-Research Triangle Park, US) 14 days ago $137,000$200,500
Kubernetes Go etcd Infrastructure as Code AIOps Observability Metrics Logs Traces Kubeflow MLflow Distributed systems On-call rotations Bare-metal infrastructure OpenShift Anthos Prometheus Grafana CI/CD
Remote

Kubernetes Platform Engineer (IT Engineer Senior)

Qualcomm

San Diego, Ca,Us, US 30 days ago
Kubernetes Rancher RKE2 GKE EKS AKS Cilium Docker ContainerD git Github Python Go bash JIRA CKAD CKA CKS Portworx MetalLB Github Actions CI/CD