Senior Software Engineer, Generative AI Systems

Nvidia

Actively hiring Posted this week

Santa Clara, CA Posted 5 days ago $152,000–$241,500 / year

View original post Log in to save

At a glance

AI generated

TL;DR

NVIDIA is hiring a Software Engineer to join its expanding AI and Generative AI engineering team. This role involves designing and developing scalable infrastructure for large-scale ML training and inference, as well as building distributed systems and cloud-native platforms supporting GPU clusters and high-performance workloads. The candidate will also develop evaluation frameworks for LLMs and agentic AI systems, optimize retrieval-augmented generation pipelines, and build backend services using FastAPI, Kubernetes, Docker, and modern cloud platforms. Key skills include strong programming in Python or C++, experience with machine learning frameworks like PyTorch or TensorFlow, and familiarity with large language models, RAG systems, and prompt engineering. Ideal candidates have a background in distributed ML training, GPU scheduling, and AI safety systems, contributing to full-stack AI applications and production deployment pipelines.

Skills

Python Kubernetes Docker FastAPI AWS GCP Azure PyTorch TensorFlow JAX DeepSpeed CI/CD Prometheus Grafana LLMs RAG Prompt Engineering Evaluation Frameworks Agentic AI Knowledge Graphs Vector Databases

What you'll do

Design and develop scalable infrastructure for large-scale ML training and inference.
Build distributed systems supporting GPU clusters and fault-tolerant training.
Develop evaluation frameworks for LLMs and agentic AI systems.
Architect retrieval-augmented generation (RAG) pipelines and knowledge management systems.
Create backend services and APIs using FastAPI, Kubernetes, Docker, and cloud platforms.
Contribute to full-stack AI applications and production deployment pipelines.

What we're looking for

Minimum 2+ years of industry experience in software engineering, AI/ML systems, distributed systems, or Generative AI applications.
Strong programming skills in Python and/or C++ with experience in scalable software systems.
Experience developing distributed systems, cloud infrastructure, backend services, and ML systems infrastructure.
Hands-on experience with machine learning frameworks like PyTorch, TensorFlow, JAX, or DeepSpeed.
Familiarity with Kubernetes, Docker, and modern cloud platforms such as AWS, GCP, or Azure.
Experience building APIs and scalable services using FastAPI, Node.js, TypeScript, or similar technologies.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 825 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 813 roles with salary data.

Most-posted roles

View all roles at Nvidia