AI Systems Performance Engineer

Broadcom

Quick summary

Work type
On-site
Location
San Jose, CA
Salary
$141,300–$226,000 / yr
Posted
47 days ago
Closes
Oct 17, 2026

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $204k
This role $184k
$129k most similar roles pay here $259k

This role pays less than 65% of similar roles. Most pay $162,562–$246,150 — the shaded band above. At the midpoint, this role pays about $184k versus about $204k for comparable roles.

Based on 240 similar postings.

Employer

About Broadcom

Broadcom is a global semiconductor and infrastructure software company that designs and markets a wide range of networking, storage, and wireless connectivity solutions. Industry: Semiconductors & Infrastructure Software

Broadcom currently has 105 open roles on FindRole.

Listed pay typically runs $120,000–$192,000 across 103 roles with salary data.

Most-posted roles

View all roles at Broadcom

At a glance

TL;DR · AI Systems Performance Engineer

As a Senior AI Fabric Performance Engineer at our Performance Lab, you will play a pivotal role in benchmarking the performance of AI inference, training, and storage workloads, focusing on optimizing Ethernet fabric to ensure seamless data flow for distributed AI workloads. Your day-to-day responsibilities include executing rigorous benchmarks using industry standards like MLPerf and NCCL, tuning network parameters, isolating system bottlenecks, developing automation tools, and collaborating with cross-functional teams to document methodologies and provide actionable recommendations. Ideal candidates have deep expertise in Linux systems, proficiency in Python and C++, knowledge of PyTorch, and experience with Ethernet switch performance testing. Additionally, familiarity with RDMA, RoCEv2, CI/CD pipelines, and containerization tools is preferred for this critical role within a fast-growing AI infrastructure environment.

What you'll do

  • Install and configure industry-standard AI performance benchmarks focusing on MLPerf and NCCL tests.
  • Optimize network parameters to enhance Ethernet fabric performance for distributed AI workloads.
  • Identify and troubleshoot complex system bottlenecks affecting Linux OS, server hardware, and switches.
  • Develop automation tools and frameworks to streamline continuous benchmarking processes.
  • Generate detailed reports to aid customer deployment and marketing teams in product positioning.

What we're looking for

  • Bachelor's/Master's degree in a relevant technical field plus extensive industry experience.
  • Deep Linux OS expertise for system-level tuning and troubleshooting.
  • Proficiency in Python and C++ programming and scripting languages.
  • Knowledge of modern machine learning frameworks like PyTorch.
  • Experience with performance testing and validating Ethernet switch systems.
  • Strong analytical skills and proficiency with benchmarking tools.
  • Ability to diagnose root causes in complex, distributed AI systems.

More like this

Similar roles

Applied AI Engineer

Ramp

Remote (New York City, New York, US) 145 days ago $155,000$339,500
Python JavaScript Node.js Django Flask React PostgreSQL MongoDB AWS GCP Kubernetes Terraform CI/CD GitOps
Remote

Applied AI Engineer

Booz Allen Hamilton

Fort Belvoir, VA 22 days ago $99,000$225,000
Python FastAPI Flask Streamlit Gradio React TypeScript Kubernetes CI/CD Prometheus Grafana MLOps Docker PostgreSQL AWS Azure Google Cloud Platform

Applied AI Engineer

Apple Inc

Cupertino, CA 24 days ago $181,100$272,100
Python FastAPI LangChain LLMs GenAI RESTful APIs Vector databases Async programming Pipeline orchestration Prometheus OpenTelemetry Redis RabbitMQ Kafka Docker CI/CD

AI Integration Engineer

Booz Allen Hamilton

Annapolis Junction, MD 39 days ago $112,800$257,000
AWS Azure Google Cloud SageMaker GCP AI Platform Azure Machine Learning MLflow Kubeflow TensorFlow Serving Vertex AI LangSmith Arize Phoenix LangGraph CrewAI AutoGen Redis Postgres Kubernetes Docker Terraform CUDA NCCL TensorRT NGINX Prometheus Grafana PyTorch Hugging Face Transformers Ray Horovod Dask

Distinguished AI Engineer

Capital One Financial

Cambridge, MA 59 days ago $269,100$307,200
Python Go Scala Java Cloud Platforms AI Systems ML Algorithms Scalable Solutions Complex AI Systems Responsible AI CI/CD