Senior Software Engineer, Generative AI Systems

Nvidia

Actively hiring Posted this week
Santa Clara, CA Posted 5 days ago $152,000$241,500 / year

At a glance

AI generated

TL;DR

NVIDIA is hiring a Software Engineer to join its expanding AI and Generative AI engineering team. This role involves designing and developing scalable infrastructure for large-scale ML training and inference, as well as building distributed systems and cloud-native platforms supporting GPU clusters and high-performance workloads. The candidate will also develop evaluation frameworks for LLMs and agentic AI systems, optimize retrieval-augmented generation pipelines, and build backend services using FastAPI, Kubernetes, Docker, and modern cloud platforms. Key skills include strong programming in Python or C++, experience with machine learning frameworks like PyTorch or TensorFlow, and familiarity with large language models, RAG systems, and prompt engineering. Ideal candidates have a background in distributed ML training, GPU scheduling, and AI safety systems, contributing to full-stack AI applications and production deployment pipelines.

Skills

Python Kubernetes Docker FastAPI AWS GCP Azure PyTorch TensorFlow JAX DeepSpeed CI/CD Prometheus Grafana LLMs RAG Prompt Engineering Evaluation Frameworks Agentic AI Knowledge Graphs Vector Databases

What you'll do

  • Design and develop scalable infrastructure for large-scale ML training and inference.
  • Build distributed systems supporting GPU clusters and fault-tolerant training.
  • Develop evaluation frameworks for LLMs and agentic AI systems.
  • Architect retrieval-augmented generation (RAG) pipelines and knowledge management systems.
  • Create backend services and APIs using FastAPI, Kubernetes, Docker, and cloud platforms.
  • Contribute to full-stack AI applications and production deployment pipelines.

What we're looking for

  • Minimum 2+ years of industry experience in software engineering, AI/ML systems, distributed systems, or Generative AI applications.
  • Strong programming skills in Python and/or C++ with experience in scalable software systems.
  • Experience developing distributed systems, cloud infrastructure, backend services, and ML systems infrastructure.
  • Hands-on experience with machine learning frameworks like PyTorch, TensorFlow, JAX, or DeepSpeed.
  • Familiarity with Kubernetes, Docker, and modern cloud platforms such as AWS, GCP, or Azure.
  • Experience building APIs and scalable services using FastAPI, Node.js, TypeScript, or similar technologies.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 825 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 813 roles with salary data.

Most-posted roles

View all roles at Nvidia