Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

Actively hiring
New York, Ny, US Posted 113 days ago $197,300$225,100 / year

At a glance

AI generated

TL;DR

As a Lead AI Engineer at FM Hosting, you will join the LLM Inference team to lead the development and optimization of large language model inference services. Your primary responsibilities include designing scalable solutions for real-time text generation, enhancing model performance through fine-tuning and optimization techniques, and collaborating with cross-functional teams to integrate these models into various hosting products. You should have extensive experience in machine learning frameworks such as TensorFlow or PyTorch, proficiency in Python, and a strong understanding of cloud computing platforms like AWS or Azure. Additionally, you must possess expertise in natural language processing and be familiar with the challenges associated with deploying large-scale AI systems in production environments.

Skills

Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Git Jenkins Prometheus Grafana

What you'll do

  • Design and optimize large language model (LLM) inference systems.
  • Implement efficient hosting solutions for AI models in production.
  • Ensure high performance and scalability of AI services.
  • Monitor and troubleshoot issues in AI deployment environments.
  • Stay updated with the latest advancements in AI technology.

What we're looking for

  • Extensive experience in AI and machine learning technologies.
  • Proficient in developing and deploying large language models (LLMs).
  • Strong background in technical leadership within a software engineering team.
  • Expertise in cloud hosting solutions for AI applications.
  • Experience with inference operations of LLMs required.

Market check

Salary context

This $197,300–$225,100 range sits above 51% of similar postings on FindRole.

Peer median band

$170,400$260,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$182,212$246,150

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Capital One Financial

Capital One Financial is a bank holding company specializing in credit cards, auto loans, banking, and savings products, known for its data-driven approach to consumer and commercial finance. Industry: Financial Services & Banking

Capital One Financial currently has 489 open roles on FindRole.

Listed pay typically runs $197,300–$225,100 across 483 roles with salary data.

Most-posted roles

View all roles at Capital One Financial

More like this

Similar roles

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, Ny, US 125 days ago $197,300$225,100
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Prometheus Grafana GitLab Jenkins

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, Ny, US 16 days ago $197,300$225,100
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Git Jenkins Prometheus Grafana

Senior Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

Mclean, Va, US 122 days ago $229,900$262,400
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Prometheus Grafana GitLab Jupyter Notebook Scikit-learn Pandas NumPy Hugging Face Transformers

Senior Lead AI Engineer (LLM Gateway, FM Hosting)

Capital One Financial

Mclean, Va, US 15 days ago $229,900$262,400
Python TensorFlow PyTorch Docker Kubernetes AWS CI/CD Git PostgreSQL Redis Scikit-learn Flask RESTful APIs Nginx Prometheus Grafana