Senior Scheduling Platform Production Engineer

Goldman Sachs

Quick summary

Work type
On-site
Location
Dallas, TX
Posted
1 day ago

Market check

Salary context

How this pay compares to similar roles

Similar $185k
$137k most similar roles pay here $230k

This listing doesn't post a salary. Most similar roles pay $149,850–$220,900.

Based on 240 similar postings.

Employer

About Goldman Sachs

Goldman Sachs is a leading global investment banking, securities, and investment management firm providing financial services to corporations, financial institutions, governments, and individuals.

Goldman Sachs currently has 187 open roles on FindRole.

Listed pay typically runs $130,000–$250,000 across 60 roles with salary data.

Most-posted roles

View all roles at Goldman Sachs

At a glance

TL;DR · Senior Scheduling Platform Production Engineer

As a Senior Scheduling Platform Production Engineer on the Runtime Platforms team, you will play a pivotal role in maintaining and enhancing the Procmon platform, which schedules millions of daily jobs for critical business functions across Goldman Sachs. Your responsibilities include identifying operational risks, refining runbooks to reduce manual tasks, building observability into new deployments, leading outage investigations, defining service level indicators and objectives, and facilitating migrations to newer platforms. You will work closely with users globally, participate in on-call rotations, and communicate effectively with both technical and non-technical stakeholders. The ideal candidate has extensive experience in DevOps or Production Engineering, proficiency in Linux systems, networking fundamentals, and real-time monitoring tools like Prometheus and Grafana. Programming skills in Go, Shell, Python, or Erlang are essential for this role within a highly regulated financial services environment.

What you'll do

  • Proactively identify operational risks, capacity concerns, and reliability gaps; implement remediation independently.
  • Refine runbooks, tooling, and automation to reduce manual tasks and enhance platform reliability.
  • Build observability for new deployments in collaboration with development teams from the start.
  • Lead real-time outage investigations and present detailed postmortems to senior management.
  • Define service level indicators (SLIs) and objectives (SLOs), ensuring systems meet SLAs.
  • Facilitate planned migrations from legacy platforms to the newest Procmon platform for users.

What we're looking for

  • Extensive experience (8+ years) in DevOps or Production Engineering, focusing on distributed computing systems.
  • Expertise in real-time monitoring and alerting tools like Prometheus for proactive incident detection and automated response.
  • Deep knowledge of Linux operating systems and system administration skills.
  • Experience with Cloud computing within enterprise environments.
  • Proficiency in programming languages such as Go, Shell, Python, or Erlang.
  • Strong communication skills to explain complex technical issues clearly to both technical and non-technical users.
  • Ability to operate effectively in a mission-critical, highly regulated financial services environment.

More like this

Similar roles

Vice President, Engineering - SRE Platforms

Goldman Sachs

Dallas, TX 1 day ago
Python Java Go AWS GCP Docker Kubernetes Terraform Puppet Chef Ansible Prometheus Grafana ELK_stack Datadog PagerDuty Jenkins GitLab Maven CI/CD Linux Networking Distributed_systems Elastic_Search Big_Query Kafka

Compliance Engineering, DevOps

Goldman Sachs

Dallas, TX 1 day ago
Python Java Perl Docker Kubernetes Prometheus Grafana OpenTelemetry Terraform AWS Azure GCP CI/CD Chaos Engineering Error Budgeting Infrastructure as Code Linux SDLC Relational Databases Hadoop

Collaboration Platforms Lead Engineer

Goldman Sachs

Dallas, TX 1 day ago
Microsoft 365 Teams SharePoint Online OneDrive Exchange Online Power Platform Microsoft Copilot Microsoft Graph Purview DLP sensitivity labels information barriers PowerShell CI/CD Azure GitHub