Lead Principal Engineer, Enterprise Agentic AI Platform

Nvidia

Actively hiring
Santa Clara, US Posted 92 days ago $272,000$431,250 / year

At a glance

AI generated

TL;DR

NVIDIA’s Enterprise AI & Automation team seeks a Principal or Distinguished Engineer to architect and build enterprise-grade agentic AI systems using Python and/or Go, focusing on Kubernetes deployment, agent runtimes, memory systems, orchestration, and evaluation pipelines. This role involves defining the architecture through practical implementations and reference systems, developing multi-agent orchestration patterns with frameworks like LangChain, and embedding security features directly into agent runtimes. The ideal candidate has over 15 years of experience in large-scale distributed systems, expertise in Kubernetes and GPU-based inference systems, and a track record of transitioning ideas to robust solutions. They must also have deep knowledge of telemetry, benchmarking, and observability systems, as well as experience with enterprise vector databases and agentic search platforms.

Skills

Python Go Kubernetes LangChain LangGraph Terraform CI/CD Prometheus Grafana PostgreSQL Redis Docker GitLab AWS Azure NVIDIA硬件 GPU加速 容器化工作负载 网络API 安全企业集成模式 基准测试 回归测试 遥测系统 可观测性系统 性能调优 混合环境 向量数据库 检索系统 Glean Microsoft Copilot Studio Google Agentspace LangChain框架 多代理管理 持续集成/持续部署 SDK开发 API设计 参考实现 企业级AI生态系统 GPU推理系统优化

What you'll do

  • Develop production-quality agentic AI systems using Python or Go, covering Kubernetes deployment and agent runtimes.
  • Define and advance NVIDIA’s Enterprise Agentic AI architecture through practical implementations and reference systems.
  • Build multi-agent orchestration patterns with strong regression coverage and observability in frameworks like LangChain.
  • Run fast POCs on emerging agent architectures and harden successful patterns into reusable platform services.
  • Architect data flywheels for continuous improvement of agent quality through telemetry, benchmarking, and feedback loops.

What we're looking for

  • Extensive experience (15+ years) in building large-scale distributed systems with hands-on coding in Python and/or Go.
  • Proven ability to rapidly prototype and scale agentic AI systems from concept to production.
  • Expertise in Kubernetes, containerized workloads, networking, APIs, and secure enterprise integration patterns.
  • Comprehensive knowledge of performance tuning in hybrid environments, including GPU-based inference systems.
  • Experience crafting benchmarking, regression testing, telemetry, and observability systems for agent quality measurement.
  • Strong collaboration skills with the ability to influence cross-functional teams and communicate complex architectural concepts effectively.

Market check

Salary context

This $272,000–$431,250 range sits above 98% of similar postings on FindRole.

Peer median band

$172,100$259,425

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$175,350$246,150

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 801 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 797 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Principal AI Engineer (Agentic AI)

Humana

Waterside Bldg, US 32 days ago $172,200$236,900
Python FastAPI Flask Kubernetes Docker GCP AWS Azure CI/CD REST gRPC Terraform Prometheus Git PyTorch TensorFlow LangChain LlamaIndex PydanticAI

Principal Engineer, Agentic AI

PayPal

Usa - California - San Jose - Corp - N First St, US 72 days ago $242,000$359,150
AI LLMs Machine Learning Reinforcement Learning Data Privacy Security Ethical AI Personalization Engines Automation Tools Conversational AI Voice Commerce Autonomous Shopping Systems Fintech Blockchain Programmatic Commerce Regulatory Compliance CI/CD Python Java JavaScript SQL NoSQL AWS Azure Google Cloud Kubernetes Docker Terraform PostgreSQL MongoDB

Senior Staff Agentic AI Engineer

Intuit

San Diego, California, US 27 days ago $220,500$298,500
Python LLMs RAG Conversational Interfaces MCP CI/CD Kubernetes Docker AWS PostgreSQL Java J2EE Prometheus Grafana Git GitHub Swagger RESTful APIs Microservices

Principal AI Engineer

Salesforce

Remote (New York - New York, US) 11 days ago $218,400$365,200
Salesforce Distributed Systems CI/CD Infrastructure-as-Code API Integration AI Agents LLM Workflows Automated Testing Observability Event-Driven Design Microservices Security & Compliance Prompt Engineering System Context Design Evaluation Frameworks GitHub Copilot Claude Code Cursor Salesforce Marketing Cloud Agentforce Google Workspace Slack
Remote

Principal AI Engineer

Humana

Louisville, KY, US 8 days ago $206,600$284,300
React TypeScript .NET Java Python Azure GCP Kubernetes Postgres LLMs VectorDBs CI/CD DevOps ETL Docker Terraform

Lead AI Engineer

Equifax

Usa - Georgia - Alpharetta - 30005, US 49 days ago
Google Cloud Platform LangChain LangGraph Python Kubernetes Terraform GitHub Actions Jenkins PostgreSQL MongoDB DynamoDB Firestore CI/CD Langfuse React Vue Angular Gemini ChatGPT Claude GitHub Copilot