Principal Machine Learning Engineer

Oracle

Quick summary

Work type
On-site
Location
Austin, TX
Salary
$114,600–$234,600 / yr
Posted
9 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $227k
This role $175k
$97k most similar roles pay here $283k

This role pays less than 85% of similar roles. Most pay $197,631–$256,000 — the shaded band above. At the midpoint, this role pays about $175k versus about $227k for comparable roles.

Based on 240 similar postings.

Employer

About Oracle

Oracle Corporation is a leading multinational technology company specializing in database software, cloud computing, and enterprise software.

Oracle currently has 607 open roles on FindRole.

Listed pay typically runs $97,500–$209,500 across 467 roles with salary data.

Most-posted roles

View all roles at Oracle

At a glance

TL;DR · Principal Machine Learning Engineer

As a Principal Machine Learning Engineer in Oracle Cloud Infrastructure's AI Infrastructure organization, you will lead the development of advanced training infrastructure for large GPU clusters and design agentic systems at enterprise scale. Your day-to-day responsibilities include designing robust software architectures using Java, Python, and other languages, participating in the full software lifecycle from development to production operations, and contributing to cutting-edge research on Generative AI models. You will work with multi-modal data generation frameworks and collaborate closely with product managers to enhance coding standards and foster an inclusive engineering culture. Essential skills include experience with distributed systems, cloud architecture best practices, and hands-on knowledge of Kubernetes and Docker for building highly available services. This role demands a strong background in system design and the ability to deliver impactful solutions in fast-paced environments.

What you'll do

  • Design and develop AI software using Java, Python, and other languages.
  • Build distributed, scalable, fault-tolerant systems for Generative AI model development.
  • Apply engineering principles to define robust architectures for bleeding-edge GPU clusters.
  • Participate in the full lifecycle of model development from training to evaluation.
  • Contribute to coding standards and enhance inclusive engineering culture within the team.
  • Identify requirements, scope solutions, estimate work, and schedule deliverables effectively.

What we're looking for

  • 6+ years of experience building and shipping enterprise distributed or cloud-native systems.
  • Strong foundation in system design, distributed systems, and cloud architecture best practices.
  • Proficiency in Java, Python, or similar object-oriented languages.
  • Experience scaling heterogeneous CPU/GPU training infrastructure for large multimodal models.
  • Hands-on experience with containers and orchestration technologies like Kubernetes and Docker.
  • Proven ability to deliver impact in collaborative, fast-paced environments.

More like this

Similar roles

Principal Machine Learning Engineer

General Motors (GM)

Remote (Sunnyvale, CA) 84 days ago $296,300$453,200
Python PyTorch Distributed Training AWS GCP Azure GPU Computing C++ Profiling Analysis Debugging Optimization Distributed Systems Cloud Environments
Remote Hybrid

Principal Machine Learning Engineer

Intuit

Mountain View, CA 55 days ago $254,500$344,000
Python TensorFlow PyTorch Java Scala Docker Kubernetes AWS CI/CD MLOps PostgreSQL Redis Git Jenkins Prometheus Grafana

Principal Machine Learning Engineer

PayPal

San Jose, CA 84 days ago $242,000$359,150
Python TensorFlow PyTorch Spark BigQuery Airflow dbt Kubernetes AWS Google Cloud CI/CD Docker Prometheus Grafana Redis PostgreSQL MongoDB GraphQL REST_API Swagger
Hybrid

Principal Machine Learning Engineer

Zillow

Remote (Remote-Usa, US) 27 days ago $204,400$326,600
Python LangGraph LangChain AgentsSDK AutoGen Spark Databricks Airflow Temporal AWS CI/CD LLM-based systems Vector stores Observability Elasticsearch Kubernetes
Remote

Principal Machine Learning Engineer

Cisco

Remote (San Jose, CA) +4 14 days ago $291,500$369,100
Python PyTorch TensorFlow NLP Log Analytics Anomaly Detection Multi-Modal AI Modeling Distributed Training MLOps CI/CD Prometheus Grafana Kubernetes AWS Azure
Remote

Lead Machine Learning Engineer

Capital One Financial

McLean, VA 37 days ago $197,300$225,100
Python TensorFlow Kubernetes AWS Docker CI/CD Git PostgreSQL Scikit-learn Pandas NumPy Jupyter Linux MLOps