Senior Kubernetes Platform Engineer - AI/ML Infrastructure

Cisco

Remote Hybrid Actively hiring
Remote · Research Triangle Park, NC · Dallas, TX · Allen, TX Posted 14 days ago $137,000$200,500 / year

At a glance

AI generated

TL;DR

As a Senior Kubernetes Platform Engineer on the Platform Engineering team, you will design and operate large-scale on-prem Kubernetes infrastructure for AI/ML workloads, including GPU-enabled environments. Your day-to-day responsibilities include architecting scalable multi-tenant platforms, enabling ML workflows, and building platform extensions using Golang-based services and Kubernetes controllers. You will also drive AIOps capabilities, improve observability, and optimize resource utilization while working closely with data scientists and infrastructure teams to ensure reliability across complex distributed systems. The role requires deep expertise in Kubernetes internals, etcd management, and hands-on experience with OpenShift/Anthos, Go programming, and AI/ML platforms like Kubeflow. This senior individual contributor position focuses on platform ownership and engineering excellence within a highly scalable and reliable infrastructure context.

Skills

Kubernetes Go etcd Infrastructure as Code AIOps Observability Metrics Logs Traces Kubeflow MLflow Distributed systems On-call rotations Bare-metal infrastructure OpenShift Anthos Prometheus Grafana CI/CD

What you'll do

  • Architect and build large-scale on-prem Kubernetes platforms for AI/ML workloads.
  • Define and evolve scalable multi-tenant platform architecture supporting GPU-based workloads.
  • Enable and optimize ML training and inference pipelines on Kubernetes infrastructure.
  • Implement Infrastructure as Code to enhance scalability and operational efficiency.
  • Drive AIOps capabilities using telemetry, automation, and self-healing systems.
  • Improve observability by optimizing metrics, logs, traces, and resource utilization.

What we're looking for

  • 8+ years of software engineering experience with a focus on Kubernetes.
  • Extensive hands-on experience managing large-scale Kubernetes control planes and etcd.
  • Proficiency in Go for building Kubernetes controllers, operators, CRDs, and webhooks.
  • Deep understanding of Kubernetes internals including API server, scheduler, and reconciliation loops.
  • Experience supporting AI/ML workloads, particularly GPU-based systems on Kubernetes.
  • Proven ability to operate and debug large-scale distributed systems and participate in on-call rotations.

Market check

Salary context

This $137,000–$200,500 range sits above 29% of similar postings on FindRole.

Peer median band

$159,575$255,800

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$161,125$246,150

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Cisco

Cisco Systems is the world''s leading networking technology company, designing and manufacturing networking hardware, telecommunications equipment, and cybersecurity solutions for businesses and governments. Industry: Networking Technology & Cybersecurity

Cisco currently has 103 open roles on FindRole.

Listed pay typically runs $165,000–$241,400 across 103 roles with salary data.

Most-posted roles

View all roles at Cisco

More like this

Similar roles

Senior Kubernetes Platform Engineer - AI Infrastructure

Cisco

Remote (Usa-Research Triangle Park, US) 14 days ago $137,000$200,500
Kubernetes OpenShift Anthos etcd Go Infrastructure as Code AIOps telemetry Prometheus Grafana Kubeflow MLflow CI/CD Docker GitOps Terraform Ansible Python PostgreSQL
Remote

Kubernetes Platform Engineer - AI Infrastructure

Cisco

Remote (Usa-Research Triangle Park, US) 14 days ago $126,500$182,000
Kubernetes OpenShift Anthos Golang Python etcd Infrastructure as Code AIOps Prometheus Grafana CI/CD GPU ML pipelines CRDs Webhooks Observability On-call support
Remote

Kubernetes Platform Engineer – AI Infrastructure

Cisco

Remote (Usa-San Jose, US) 14 days ago $152,500$219,200
Kubernetes OpenShift Anthos etcd Golang Python Infrastructure as Code AIOps CRDs Controllers Operators Webhooks GPU-based workloads AI/ML pipelines Observability Telemetry CI/CD
Remote

Kubernetes Platform Engineer (IT Engineer Senior)

Qualcomm

San Diego, Ca,Us, US 30 days ago
Kubernetes Rancher RKE2 GKE EKS AKS Cilium Docker ContainerD git Github Python Go bash JIRA CKAD CKA CKS Portworx MetalLB Github Actions CI/CD

Senior Kubernetes Software Engineer

Broadcom

Usa-Ca - Promontory B, US 51 days ago $120,000$192,000
Kubernetes Go CNCF CI/CD vSphere Docker Terraform AWS GCP Azure PostgreSQL Prometheus GitLab GitHub Maven Jenkins Ansible Python Shell_scripting