Senior Principal Network Development Engineer (IC5) – Backend NIC Qualification & NPI (OCI AI2 – Performance & NIC Engineering)

Oracle

Actively hiring
Austin, TX · Nashville, TN Posted 45 days ago $96,800$251,600 / year

At a glance

AI generated

TL;DR

We are seeking a Senior Principal Network Development Engineer (IC5) to lead backend NIC qualification and New Product Introduction for next-generation networking platforms supporting GPU- and accelerator-based clusters in OCI’s AI infrastructure. This role involves driving cross-functional initiatives, owning complex problem spaces end-to-end, and collaborating with hardware vendors and internal teams to ensure NIC technologies meet stringent performance requirements. Key responsibilities include defining validation methodologies, leading deep performance characterization, building automated frameworks for continuous qualification, and using production telemetry to inform validation strategy. The ideal candidate has extensive experience in networking, systems engineering, and Linux kernel-level debugging, along with proficiency in Python or Bash scripting. Expertise in RDMA/RoCE, congestion control, SR-IOV, and distributed AI/ML workloads is essential for this role that impacts OCI’s AI infrastructure at a fleet level.

Skills

NIC RDMA RoCE Linux Python Bash CI/CD InfiniBand NCCL PCIe NUMA SmartNICs DPUs Kubernetes Docker Prometheus Grafana

What you'll do

  • Own end-to-end qualification strategy for backend NICs supporting OCI AI clusters.
  • Lead NIC NPI from early silicon bring-up to fleet-wide deployment across regions.
  • Define validation methodologies for high-performance distributed training workloads.
  • Drive performance characterization and tuning of NICs in AI cluster environments.
  • Partner with vendors to resolve complex hardware/firmware issues and influence design.

What we're looking for

  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related field
  • 8–12+ years of experience in networking, systems engineering, or hardware validation in large-scale distributed environments
  • Deep expertise in NIC architecture and advanced features (RDMA/RoCE, congestion control, SR-IOV)
  • Strong understanding of Linux networking stack and kernel-level debugging
  • Proven experience leading hardware qualification and NPI efforts in data center or cloud environments
  • Strong debugging skills across hardware, firmware, driver, and system layers

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $182k
This role $174k
$78k most similar roles pay here $270k

This role pays less than 55% of similar roles. Most pay $146,425–$217,725 — the shaded band above. At the midpoint, this role pays about $174k versus about $182k for comparable roles.

Based on 240 similar postings.

Employer

About Oracle

Oracle Corporation is a leading multinational technology company specializing in database software, cloud computing, and enterprise software.

Oracle currently has 343 open roles on FindRole.

Listed pay typically runs $97,500–$199,500 across 253 roles with salary data.

Most-posted roles

View all roles at Oracle

More like this

Similar roles

Senior Software Engineer, AI Networking

Nvidia

Austin, TX 72 days ago $184,000$287,500
C C++ RDMA verbs DPDK DOCA NCCL CUDA InfiniBand RoCE Docker Kubernetes AWS CI/CD Prometheus Grafana Python PostgreSQL

Senior Software Engineer, AI Networking

Nvidia

Santa Clara, CA 19 days ago $152,000$241,500
Python PyTorch TensorFlow JAX CUDA NCCL Reinforcement_Learning Bayesian_Optimization GNNs Docker Kubernetes CI/CD Prometheus Grafana Bash C++ PostgreSQL Redis

Senior Network Automation Engineer, Full-Stack

The Federal Reserve

Richmond, VA 36 days ago $94,600$130,020
AWS Terraform NextJS JavaScript Java Python MuleSoft Angular Red Hat OpenShift CI/CD GitHub GitLab Maven Jenkins ServiceNow CloudFormation