Senior Software Engineer – TensorRT Edge-LLM

Nvidia

Actively hiring
Remote (Us, Ca, Santa Clara, US) Posted 72 days ago $152,000$241,500 / year

At a glance

AI generated

TL;DR

Join NVIDIA’s TensorRT Edge-LLM team as a senior software engineer and help develop cutting-edge inference frameworks for large language models running on embedded devices. Your day-to-day will involve extending TensorRT with autoregressive model serving capabilities, implementing compiler and runtime optimizations for transformer-based models, and collaborating across teams to deliver high-performance solutions. You’ll also contribute to CUDA kernel development, benchmark performance in diverse environments, and stay current with emerging LLM/VLM techniques. Ideal candidates have a deep understanding of transformer models, proficiency in modern C++, familiarity with TensorRT and related frameworks, and experience with CUDA for efficient GPU programming. This role requires expertise in low-latency, resource-constrained systems and a track record of strong software design and collaboration.

Skills

C++ TensorRT CUDA vLLM SGLang MLC-LLM FlashInfer Transformer models Quantization Tensor parallelism Memory-efficient scheduling Speculative decoding KV cache management Compiler infrastructure Robotics Embedded AI pipelines Performance profiling GPU architecture

What you'll do

  • Develop and evolve a state-of-the-art inference framework for autoregressive models.
  • Design compiler and runtime optimizations for transformer-based models on constrained platforms.
  • Contribute to CUDA kernel development for critical transformer components like attention and GEMM.
  • Benchmark and optimize inference performance across diverse embedded environments.
  • Stay updated with emerging techniques in the LLM/VLM ecosystem and integrate them into software.

What we're looking for

  • 4+ years of relevant software development experience in AI or related field.
  • Deep understanding of transformer models and inference optimization techniques.
  • Proficient programming ability with modern C++ (C++11/14/17 and beyond).
  • Familiarity with TensorRT, vLLM, SGLang, MLC-LLM, FlashInfer frameworks/libraries.
  • Experience in CUDA kernel development, performance profiling, GPU architecture.
  • Track record of strong software design, execution, and cross-team collaboration.
  • Prior work on autoregressive LLM serving systems including speculative decoding.

Market check

Salary context

This $152,000–$241,500 range sits above 75% of similar postings on FindRole.

Peer median band

$119,800$230,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$142,400$197,562

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Nvidia

Nvidia is a leading designer of graphics processing units (GPUs) and system-on-chip units, powering gaming, professional visualization, data centers, and artificial intelligence workloads. Industry: Semiconductors & AI Computing

Nvidia currently has 802 open roles on FindRole.

Listed pay typically runs $184,000–$287,500 across 798 roles with salary data.

Most-posted roles

View all roles at Nvidia

More like this

Similar roles

Senior Software Engineer

The Walt Disney Company

Remote (Usa - Ca - 2450 Broadway, US) 73 days ago $141,900$190,300
Java Kotlin AWS Azure Google Cloud Docker Jenkins Kafka Kinesis SQS Datadog Splunk Grafana CI/CD RESTful services Git Scrum Agile Messaging technologies Observability tools
Remote

Senior Software Engineer

Adobe

Lehi, US 52 days ago $139,000$139,000
Java React AWS GCP Azure AI/ML Python CI/CD Docker Kubernetes PostgreSQL Git GitHub Jenkins Prometheus Grafana DevOps Agile Scrum

Senior Software Engineer

Microsoft

Redmond, Wa,Us, US 114 days ago $119,800$234,700
Python Java Go C++ Docker Kubernetes AWS Azure CI/CD PostgreSQL MongoDB Redis GraphQL OAuth OpenIDConnect ZeroTrustArchitecture

Senior Software Engineer

Broadcom

Usa-Ma-Burlington - Blue Sky, US 86 days ago $108,000$172,800
Java Kubernetes GitHub Maven Jenkins Docker CI/CD Git Linux Python PostgreSQL VMware vSphere vSAN NSX Terraform AWS Azure

Senior Software Engineer

Warner Bros. Discovery

Remote (Ga Atlanta 1050 Techwood Drive Nw, US) 79 days ago
JavaScript Node.js Python AWS Svelte CSS Agile CI/CD Datadog New Relic vue.js handlebars.js
Remote

Senior Software Engineer

Adobe

San Jose, US 73 days ago $228,600$331,050
Apache Spark Hadoop Apache Kafka AWS S3 Azure Data Lake Storage Apache Parquet Databricks Delta Apache Iceberg Apache Hudi Apache HBase Cassandra MongoDB Azure Cosmos DB Java Scala CI/CD Agile