Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

Actively hiring
New York, Ny, US Posted 125 days ago $197,300$225,100 / year

At a glance

AI generated

TL;DR

As a Lead AI Engineer at FM Hosting, you will join a dynamic team focused on deploying and optimizing large language model (LLM) inference services. Your primary responsibilities include designing scalable infrastructure to support real-time LLM interactions, ensuring high performance and reliability. You will work closely with data scientists and software engineers to integrate cutting-edge machine learning techniques into our platform, enhancing user experience through advanced natural language processing capabilities. Ideal candidates possess extensive hands-on experience with cloud computing platforms like AWS or Azure, proficiency in Python and other relevant programming languages, and a strong background in AI and ML technologies. This role demands expertise in managing complex data pipelines, containerization tools such as Docker and Kubernetes, and an understanding of the unique challenges associated with large-scale deployment of LLMs.

Skills

Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Prometheus Grafana GitLab Jenkins

What you'll do

  • Design and optimize large language model (LLM) inference systems.
  • Implement efficient hosting solutions for AI models in FM environment.
  • Develop scalable infrastructure to support real-time LLM processing.
  • Ensure high performance and low latency in AI service delivery.
  • Monitor and enhance the security of AI model deployments.

What we're looking for

  • Extensive experience in AI and machine learning technologies.
  • Proficient in developing and deploying large language models (LLMs).
  • Strong background in software engineering and cloud computing platforms.
  • Expertise in natural language processing (NLP) techniques and applications.
  • Experience with full-stack development, including front-end and back-end systems.

Market check

Salary context

This $197,300–$225,100 range sits above 51% of similar postings on FindRole.

Peer median band

$170,400$260,000

Median floor and ceiling across peers.

Typical midpoint (25–75%)

$182,212$246,150

Middle half of comparable postings.

Based on 240 comparable postings.

* 240 is the maximum number of comparable postings sampled.

Employer

About Capital One Financial

Capital One Financial is a bank holding company specializing in credit cards, auto loans, banking, and savings products, known for its data-driven approach to consumer and commercial finance. Industry: Financial Services & Banking

Capital One Financial currently has 489 open roles on FindRole.

Listed pay typically runs $197,300–$225,100 across 483 roles with salary data.

Most-posted roles

View all roles at Capital One Financial

More like this

Similar roles

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, Ny, US 113 days ago $197,300$225,100
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Git Jenkins Prometheus Grafana

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

New York, Ny, US 16 days ago $197,300$225,100
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Git Jenkins Prometheus Grafana

Senior Lead AI Engineer (FM Hosting, LLM Inference)

Capital One Financial

Mclean, Va, US 122 days ago $229,900$262,400
Python TensorFlow PyTorch Kubernetes Docker AWS CI/CD PostgreSQL Redis Prometheus Grafana GitLab Jupyter Notebook Scikit-learn Pandas NumPy Hugging Face Transformers

Senior Lead AI Engineer (LLM Gateway, FM Hosting)

Capital One Financial

Mclean, Va, US 15 days ago $229,900$262,400
Python TensorFlow PyTorch Docker Kubernetes AWS CI/CD Git PostgreSQL Redis Scikit-learn Flask RESTful APIs Nginx Prometheus Grafana