Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

Actively hiring
New York, NY · McLean, VA · San Jose, CA Posted 17 days ago $197,300$225,100 / year

At a glance

AI generated

TL;DR

As a Lead AI Engineer at FM Hosting, you will lead the team responsible for deploying and optimizing large language model (LLM) inference services. Your primary focus will be on developing scalable solutions to enhance performance and efficiency of LLMs in real-time applications. You will work closely with data scientists and software engineers to integrate advanced machine learning models into existing infrastructure, ensuring seamless user experiences. The ideal candidate possesses extensive experience with cloud platforms like AWS or Azure, proficiency in Python for model deployment, and expertise in Kubernetes for container orchestration. Additionally, familiarity with natural language processing (NLP) techniques and a strong understanding of distributed systems are crucial to tackle the challenges of scaling AI services at high volume.

Skills

Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Git Jenkins Prometheus Grafana

What you'll do

  • Design and optimize large language model (LLM) inference systems.
  • Implement efficient hosting solutions for AI models in production environments.
  • Develop scalable infrastructure to support real-time LLM processing.
  • Ensure high performance and low latency in AI service delivery.
  • Monitor and maintain the security of AI systems and data.

What we're looking for

  • Extensive experience in large language model (LLM) inference and deployment.
  • Proficient in hosting and managing AI services at scale.
  • Strong background in technical leadership within AI engineering teams.
  • Expertise in cloud computing platforms like AWS, Azure, or GCP.
  • Experience with machine learning frameworks such as TensorFlow or PyTorch.
  • Deep understanding of natural language processing (NLP) techniques.
  • Solid track record in delivering high-performance AI solutions.

Market check

Salary context

This $197,300–$225,100 range sits above 51% of similar postings on FindRole.

Peer median band

$170,400$260,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$182,212$246,150

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Capital One Financial

Capital One Financial is a bank holding company specializing in credit cards, auto loans, banking, and savings products, known for its data-driven approach to consumer and commercial finance. Industry: Financial Services & Banking

Capital One Financial currently has 489 open roles on FindRole.

Listed pay typically runs $197,300–$225,100 across 483 roles with salary data.

Most-posted roles

View all roles at Capital One Financial

More like this

Similar roles

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, Ny, US 126 days ago $197,300$225,100
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Prometheus Grafana GitLab Jenkins

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, Ny, US 114 days ago $197,300$225,100
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Git Jenkins Prometheus Grafana

Senior Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

Mclean, Va, US 123 days ago $229,900$262,400
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Prometheus Grafana GitLab Jupyter Notebook Scikit-learn Pandas NumPy Hugging Face Transformers

Senior Lead AI Engineer (LLM Gateway, FM Hosting)

Capital One Financial

Mclean, Va, US 16 days ago $229,900$262,400
Python TensorFlow PyTorch Docker Kubernetes AWS CI/CD Git PostgreSQL Redis Scikit-learn Flask RESTful APIs Nginx Prometheus Grafana