Senior Software Engineer, Generative AI Systems
At a glance
AI generatedTL;DR
NVIDIA is hiring a Software Engineer to join its expanding AI and Generative AI engineering team. This role involves designing and developing scalable infrastructure for large-scale ML training and inference, as well as building distributed systems and cloud-native platforms supporting GPU clusters and high-performance workloads. The candidate will also develop evaluation frameworks for LLMs and agentic AI systems, optimize retrieval-augmented generation pipelines, and build backend services using FastAPI, Kubernetes, Docker, and modern cloud platforms. Key skills include strong programming in Python or C++, experience with machine learning frameworks like PyTorch or TensorFlow, and familiarity with large language models, RAG systems, and prompt engineering. Ideal candidates have a background in distributed ML training, GPU scheduling, and AI safety systems, contributing to full-stack AI applications and production deployment pipelines.
Skills
What you'll do
- Design and develop scalable infrastructure for large-scale ML training and inference.
- Build distributed systems supporting GPU clusters and fault-tolerant training.
- Develop evaluation frameworks for LLMs and agentic AI systems.
- Architect retrieval-augmented generation (RAG) pipelines and knowledge management systems.
- Create backend services and APIs using FastAPI, Kubernetes, Docker, and cloud platforms.
- Contribute to full-stack AI applications and production deployment pipelines.
What we're looking for
- Minimum 2+ years of industry experience in software engineering, AI/ML systems, distributed systems, or Generative AI applications.
- Strong programming skills in Python and/or C++ with experience in scalable software systems.
- Experience developing distributed systems, cloud infrastructure, backend services, and ML systems infrastructure.
- Hands-on experience with machine learning frameworks like PyTorch, TensorFlow, JAX, or DeepSpeed.
- Familiarity with Kubernetes, Docker, and modern cloud platforms such as AWS, GCP, or Azure.
- Experience building APIs and scalable services using FastAPI, Node.js, TypeScript, or similar technologies.
Employer
About Nvidia
Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing
Nvidia currently has 825 open roles on FindRole.
Listed pay typically runs $184,000–$287,500 across 813 roles with salary data.
Most-posted roles
- Senior Solutions Architect, AI Infrastructure 4
- Senior System Software Engineer - AV Platform 4
- Senior Circuit Design Engineer 3
- Senior Circuit Methodology Engineer 3
- Senior Deep Learning Performance Architect 3