Lead Systems Operations Engineer
At a glance
AI generatedTL;DR
The Lead Systems Operations Engineer role within the Technology Operations team requires a seasoned professional with extensive experience in Kubernetes and OpenShift platform operations. This senior-level position involves leading complex initiatives, providing high-level systems consultation, and driving operational excellence to improve stability and automation. Day-to-day responsibilities include managing cluster maintenance, performance monitoring, incident response, and developing Python-based automation tools to streamline run processes. The ideal candidate will have deep expertise in Linux system administration, Python scripting for platform operations, and using tools like Grafana, Splunk, and Prometheus for observability solutions. They must also collaborate with various teams to ensure compliance and security while continuously identifying operational gaps and implementing improvements to enhance reliability and resiliency at an enterprise scale.
Skills
What you'll do
- Lead day-to-day operations of REDIS and OpenShift platforms, including maintenance and troubleshooting.
- Drive rapid diagnosis and resolution during incidents, conducting root cause analysis and implementing corrective actions.
- Develop automation solutions using Python, Bash, GitOps workflows, and AI-assisted tools to streamline run processes.
- Ensure platform readiness through lifecycle activities such as new cluster builds, configuration, and decommissioning.
- Identify operational gaps and inefficiencies, leading initiatives to enhance reliability and resiliency of critical infrastructure services.
- Collaborate with security teams to ensure platform operations comply with organizational policies and regulatory requirements.
What we're looking for
- 5+ years of experience in systems engineering and platform operations automation using Python
- Deep expertise in managing complex, enterprise-scale applications and platforms like OpenShift and Kubernetes
- Strong proficiency in designing observability solutions with tools such as Grafana, Splunk, and Prometheus
- Extensive hands-on Linux system administration experience and cluster build-outs
- Ability to develop and enhance operational automation and AI-assisted tools
- Demonstrated leadership in incident response, root cause analysis, and continuous improvement initiatives
Employer
About Wells Fargo
Wells Fargo & Company is one of the largest banks in the United States, providing banking, investment, mortgage, and consumer and commercial finance products and services nationwide. Industry: Banking & Financial Services
Wells Fargo currently has 63 open roles on FindRole.
Listed pay typically runs $119,000–$224,000 across 31 roles with salary data.
Most-posted roles
- Lead Software Engineer 4
- Lead Systems Operations Engineer 3
- Lead Information Security Engineer 2
- Senior Systems Operations Engineer 2
- Application Penetration Testing Senior Manager 1