Senior Software Engineer – Application Reliability , Hybrid

Cisco

Remote Hybrid Actively hiring Verified listing
San Jose, CA · North Carolina · Remote, USA Posted 11 days ago $199,700$254,600 / year

At a glance

AI generated

TL;DR

As a Senior Software Engineer in Application Reliability at Cisco's Enterprise AI team in San Jose or North Carolina, you will focus on ensuring the reliability and observability of AI-powered applications and features from a user-centric perspective. You will define and enforce feature-level SLIs, SLOs, and error budgets while building LangGraph-based agents for automated diagnostics and Looker dashboards for real-time visibility using BigQuery and Python. Key responsibilities include developing agent evaluation harnesses, writing complex SQL queries, analyzing usage trends, and partnering with development teams to embed reliability practices into the development lifecycle. The role requires strong Python skills, GCP experience, and expertise in Kubernetes, BigQuery, and BigTable, along with a deep understanding of application-level observability frameworks and AIOps concepts. This position offers an exciting opportunity to work on cutting-edge AI technologies at scale within a collaborative team environment.

Skills

Python BigQuery GKE Kubernetes Looker BigTable SQL CI/CD Docker Prometheus Grafana Cloud Logging Cloud Trace Cloud Monitoring LangGraph AIOps MLOps GenAI Feature Flags Canary Deployments Progressive Rollouts Automated Rollback Structured Logging Distributed Tracing

What you'll do

  • Define and enforce feature-level SLIs, SLOs, and error budgets for AI-powered applications.
  • Build LangGraph-based agents for automated issue identification and remediation in AI systems.
  • Develop Python-based tooling to reduce mean time to detect (MTTD) and resolve (MTTR) application issues.
  • Write complex SQL queries on BigQuery for usage trend analysis and operational analytics.
  • Analyze application usage trends to proactively identify reliability risks and degraded user experiences.
  • Design and maintain Looker dashboards using BigQuery and BigTable for real-time feature observability.
  • Partner with development teams to embed reliability practices into the software development lifecycle.

What we're looking for

  • 10+ years of software engineering experience with a focus on reliability, observability, or production operations.
  • Strong Python development skills for building production tooling and automation.
  • Extensive GCP experience including Kubernetes (GKE), BigQuery SQL expertise, and BigTable hands-on experience.
  • Proven ability to design and operate application-level SLI/SLO frameworks and error budget policies.
  • Expert debugging skills at the application layer with distributed tracing, profiling, and log analysis.

Market check

Salary context

This $199,700–$254,600 range sits above 90% of similar postings on FindRole.

Peer median band

$117,000$208,150

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$140,400$197,175

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Cisco

Cisco Systems is the world''s leading networking technology company, designing and manufacturing networking hardware, telecommunications equipment, and cybersecurity solutions for businesses and governments. Industry: Networking Technology & Cybersecurity

Cisco currently has 103 open roles on FindRole.

Listed pay typically runs $165,000–$241,400 across 103 roles with salary data.

Most-posted roles

View all roles at Cisco

More like this

Similar roles

Senior Software Application Engineer

Qualcomm

San Diego, Ca,Us, US 15 days ago $108,300$162,500
C C++ Python Java Android OS Linux kernel ARM architecture CPU GPU DDR DSP BSP Profiling tools Analysis tools Debugging techniques Performance analysis Power analysis Thermal analysis System performance optimization Customer-facing experience Technical presentations Training sessions

Senior Software Engineer, (Hybrid)

Cisco

Remote (Usa-Research Triangle Park, US) 31 days ago $137,000$200,500
C C++ Linux Multicast IPv6 Quality-of-service Segmentation Segment Routing Routing Protocols OSPF BGP Controller Driven Architecture AI/ML Test Automation Static Analysis Tools WireShark
Remote

Senior Software Engineer

Apex

US 122 days ago
Java Python PostgreSQL jOOQ Bazel gRPC Protobuf Flyway PubSub Datadog AWS CI/CD SQL Agile Jira

Senior Software Engineer

Q2

Cary, North Carolina, US 70 days ago
.NET SQL Server C# HTML/CSS JavaScript LLM-based systems RAG fundamentals Vector search integration Chunking strategies Context window management Agentic patterns MVVM Vue Angular React Test automation frameworks SOLID principles Agile development CI/CD

Senior Software Engineer

Microsoft

Redmond, Wa,Us, US 115 days ago $119,800$234,700
Python Java Go C++ Docker Kubernetes AWS Azure CI/CD PostgreSQL MongoDB Redis GraphQL OAuth OpenIDConnect ZeroTrustArchitecture

Senior Software Engineer

Prudential Financial

Wash, 213 Washington St., Newark, Nj, US 91 days ago $104,000$171,600
React Springboot Docker Terraform AWS Kubernetes DevOps CI/CD GitHub Jenkins Python Java Node.js HTML CSS JavaScript DynamoDB ECS Lambda RDS S3 Observability Metric Logs Tracing Agile Methodology