Principal TPM -AI Infrastructure

Oracle

Quick summary

Work type
On-site
Location
Seattle, WA · Santa Clara, CA
Posted
15 days ago

Market check

Salary context

How this pay compares to similar roles

Similar $213k
$164k most similar roles pay here $267k

This listing doesn't post a salary. Most similar roles pay $179,937–$246,150.

Based on 239 similar postings.

Employer

About Oracle

Oracle Corporation is a leading multinational technology company specializing in database software, cloud computing, and enterprise software.

Oracle currently has 467 open roles on FindRole.

Listed pay typically runs $97,500–$209,500 across 353 roles with salary data.

Most-posted roles

View all roles at Oracle

At a glance

TL;DR · Principal TPM -AI Infrastructure

As a Principal Technical Program Manager on the AI Infrastructure GPU Operations Team in Seattle, you will lead cross-functional initiatives connecting engineering, operations, finance, and senior leadership to drive deployment planning and operational readiness for OCI’s expanding GPU infrastructure. Your daily responsibilities include managing regional deployment readiness, tracking fleet health across NVIDIA and AMD platforms, and coordinating incident governance and risk management processes. You will also enhance organizational scalability by improving dashboards, documentation, and the practical use of AI in operations productivity. This role requires expertise in program discipline, business analytics, and clear communication to ensure disciplined execution and measurable reliability outcomes in a high-visibility environment. Ideal candidates have 6+ years of experience in technical program management with strong backgrounds in infrastructure operations, data analysis, and cross-functional leadership.

What you'll do

  • Drive availability and reliability of large-scale GPU fleets by identifying systemic issues and leading recovery efforts.
  • Own end-to-end execution of critical AI Infrastructure GPU Operations programs to ensure alignment with business priorities.
  • Manage deployment governance, change review processes, and incident management mechanisms for high-volume activities.
  • Build and maintain executive-level reporting including monthly business reviews and weekly operational KPIs.
  • Improve operations productivity by driving practical use of AI and automation in GPU operations workflows.

What we're looking for

  • 5+ years of experience in technical program management or related field
  • Proven ability to lead complex cross-functional initiatives with measurable outcomes
  • Strong operational background including cadence building, governance, KPI reporting
  • Advanced Excel skills for data modeling, financial analysis, and operational insights
  • Experience developing dashboards and automated reports for business visibility
  • Knowledge of cloud infrastructure, AI/ML operations, GPU fleet management preferred
  • Excellent written and verbal communication skills for executive updates and decisions

More like this

Similar roles

OCI & AI Infrastructure Pursuits Lead

Oracle

New York, NY 21 days ago $97,500$199,500
Oracle Cloud Infrastructure AWS Azure GCP AI/ML HPC CI/CD Python PostgreSQL Kubernetes Docker Terraform Prometheus Grafana

TPM - AI Ready Data Solutions

Johnson & Johnson

Remote (Ireland) 9 days ago $137,000$235,750
AI Data Governance Information Architecture Knowledge Graph Semantic Modelling Ontology Data Quality Metadata Management CI/CD Python R SQL Kafka Hadoop Spark PostgreSQL AWS Azure GCP Docker Kubernetes Terraform Git Jenkins Prometheus Grafana
Remote

Principal Technical Program Manager- AI Infrastructure

Microsoft

Redmond, WA today $142,800$274,800
Azure Kubernetes Docker CI/CD Python PostgreSQL Prometheus Grafana AWS Terraform Git Linux REST JSON/WebAPI Scalability Security Reliability PerformanceOptimization AIWorkloadsTrainingInference

AI Governance Technology Lead

Global Payments (TSYS)

Alpharetta, GA 66 days ago
Python SQL AI Governance Platforms Risk Assessment Tools Synthetic Data Generation Automated Testing Frameworks Anomaly Detection AI Observability Tools Bias Audits Adversarial Testing Stress Testing Statistical Testing XAI Techniques CI/CD PostgreSQL MLOps EU AI Act GDPR CCPA NIST AI RMF

AI Governance Technology Lead

Global Payments (TSYS)

Alpharetta, GA 66 days ago
Python SQL AI Governance Platforms Risk Assessment Tools Synthetic Data Generation Automated Testing Frameworks Anomaly Detection AI Observability Tools CI/CD EU AI Act GDPR CCPA NIST AI RMF Bias Audits Adversarial Testing Stress Testing Explainable AI (XAI) Model Validation Techniques Monitoring and Logging Frameworks