Lead Systems Operations Engineer

Wells Fargo

Hybrid Actively hiring Posted this week
Chandler, AZ · Irving, TX · Charlotte, NC Posted 4 days ago

At a glance

AI generated

TL;DR

The Lead Systems Operations Engineer role within the Technology Operations team requires a seasoned professional with extensive experience in Kubernetes and OpenShift platform operations. This senior-level position involves leading complex initiatives, providing high-level systems consultation, and driving operational excellence to improve stability and automation. Day-to-day responsibilities include managing cluster maintenance, performance monitoring, incident response, and developing Python-based automation tools to streamline run processes. The ideal candidate will have deep expertise in Linux system administration, Python scripting for platform operations, and using tools like Grafana, Splunk, and Prometheus for observability solutions. They must also collaborate with various teams to ensure compliance and security while continuously identifying operational gaps and implementing improvements to enhance reliability and resiliency at an enterprise scale.

Skills

Python Kubernetes OpenShift Grafana Prometheus Splunk CI/CD GitOps Linux Redis MCP AI Jira GitHub

What you'll do

  • Lead day-to-day operations of REDIS and OpenShift platforms, including maintenance and troubleshooting.
  • Drive rapid diagnosis and resolution during incidents, conducting root cause analysis and implementing corrective actions.
  • Develop automation solutions using Python, Bash, GitOps workflows, and AI-assisted tools to streamline run processes.
  • Ensure platform readiness through lifecycle activities such as new cluster builds, configuration, and decommissioning.
  • Identify operational gaps and inefficiencies, leading initiatives to enhance reliability and resiliency of critical infrastructure services.
  • Collaborate with security teams to ensure platform operations comply with organizational policies and regulatory requirements.

What we're looking for

  • 5+ years of experience in systems engineering and platform operations automation using Python
  • Deep expertise in managing complex, enterprise-scale applications and platforms like OpenShift and Kubernetes
  • Strong proficiency in designing observability solutions with tools such as Grafana, Splunk, and Prometheus
  • Extensive hands-on Linux system administration experience and cluster build-outs
  • Ability to develop and enhance operational automation and AI-assisted tools
  • Demonstrated leadership in incident response, root cause analysis, and continuous improvement initiatives

Employer

About Wells Fargo

Wells Fargo & Company is one of the largest banks in the United States, providing banking, investment, mortgage, and consumer and commercial finance products and services nationwide. Industry: Banking & Financial Services

Wells Fargo currently has 63 open roles on FindRole.

Listed pay typically runs $119,000–$224,000 across 31 roles with salary data.

Most-posted roles

View all roles at Wells Fargo