Principal Engineer - AI Networking

Oracle

Quick summary

Work type
On-site
Location
Austin, TXSeattle, WA
Salary
$99,600–$234,600 / yr
Posted
5 days ago

Market check

Salary context

Below market

How this pay compares to similar roles

Similar $209k
This role $167k
$81k most similar roles pay here $274k

This role pays less than 76% of similar roles. Most pay $172,375–$246,150 — the shaded band above. At the midpoint, this role pays about $167k versus about $209k for comparable roles.

Based on 240 similar postings.

Employer

About Oracle

Oracle Corporation is a leading multinational technology company specializing in database software, cloud computing, and enterprise software.

Oracle currently has 755 open roles on FindRole.

Listed pay typically runs $97,500–$209,500 across 568 roles with salary data.

Most-posted roles

View all roles at Oracle

At a glance

TL;DR · Principal Engineer - AI Networking

As a Principal Engineer in the AI Networking team, you will leverage your deep expertise in RDMA and high-performance networking to design, implement, and optimize critical infrastructure for large-scale AI workloads. Your day-to-day responsibilities include building robust collective communication frameworks, enhancing transport layers, and developing advanced congestion management and resiliency features. You will collaborate closely with cross-functional teams to deliver scalable distributed systems that support both training and inference environments, ensuring optimal performance across networking, GPU, and software stacks. Essential skills for this role include proficiency in C/C++ and Linux systems programming, extensive experience with RDMA technologies like RoCEv2 and InfiniBand, and a strong background in diagnosing and resolving complex performance issues. Additionally, familiarity with AI/ML infrastructure, distributed training frameworks, and cloud platforms is highly desirable as you contribute to architectural design discussions and help maintain engineering excellence within the team.

What you'll do

  • Design, develop, and optimize RDMA-based software components for large-scale AI infrastructure.
  • Build collective communication frameworks and transport layers for distributed AI workloads.
  • Develop congestion management and load balancing capabilities for RDMA networks.
  • Analyze and improve performance across networking, GPU, and software stacks in production.
  • Investigate and resolve complex networking issues affecting AI training and inference environments.
  • Contribute to architectural design discussions for networking platforms supporting AI systems.

What we're looking for

  • 7+ years of software engineering experience in systems software, networking, or distributed systems
  • Strong hands-on expertise with RDMA technologies like RoCEv2 and InfiniBand
  • Experience developing RDMA-enabled software and communication libraries
  • Proficiency in C/C++ and Linux systems programming
  • Solid understanding of networking fundamentals, operating systems, and distributed systems concepts
  • Ability to diagnose and solve complex performance and scalability problems
  • Strong collaboration and communication skills in cross-functional engineering environments

More like this

Similar roles

Senior Principal Engineer - AI Networking

Oracle

Austin, TX +1 5 days ago $96,800$306,400
RDMA InfiniBand C/C++ Linux NCCL RCCL MPI UCX XCCL PyTorch DeepSpeed Megatron-LM TensorFlow JAX Kubernetes GPU networking GPUDirect RDMA RoCE congestion management adaptive routing traffic shaping network resiliency Docker CI/CD

Senior Principal Engineer - AI Networking

Oracle

Seattle, WA 5 days ago $96,800$306,400
RDMA InfiniBand C/C++ Linux NCCL RCCL MPI UCX XCCL PyTorch DeepSpeed Megatron-LM TensorFlow JAX Kubernetes GPU GPUDirect RHEL Networking Distributed Systems

Principal Network Engineer - AI Infrastructure

CVS Health

Remote (Work At Home-New York, US) 5 days ago $144,200$288,400
Cisco NVIDIA Palo_Alto_Networks BGP OSPF MPLS STP SD_WAN NetFlow Wireshark SolarWinds EVPN VXLAN RDMA_over_Converged_Ethernet F5_Load_Balancing ACI CCIE CISSP AWS Azure GCP
Remote

Principal AI Engineer

Salesforce

New York +4 19 days ago $218,400$365,200
Salesforce Distributed Systems CI/CD Infrastructure-as-Code API Integration AI Agents LLM Workflows Automated Testing Observability Event-Driven Design Microservices Security & Compliance Prompt Engineering System Context Design Evaluation Frameworks GitHub Copilot Claude Code Cursor Salesforce Marketing Cloud Agentforce Google Workspace Slack

Principal AI Engineer

Humana

DC +4 26 days ago $206,600$284,300
React TypeScript .NET Java Python Azure GCP Kubernetes Postgres LLMs VectorDBs CI/CD DevOps ETL Docker Terraform
Hybrid

Principal AI Engineer

Salesforce

Remote (San Francisco, CA) +4 18 days ago $197,300$313,700
AWS Python GitHub Actions ArgoCD Terraform Docker Kubernetes Grafana Braintrust LangSmith CI/CD AgentOps Salesforce Ecosystem Vector Databases Graph Databases RAG Pipelines Snowflake Kafka Flink
Remote