Director, AI Alignment and Interpretability (Remote)

CrowdStrike

Remote

Quick summary

Work type
Remote
Location
Remote
Salary
$195,000–$290,000 / yr
Posted
5 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $229k
This role $242k
$184k most similar roles pay here $301k

This role pays more than 60% of similar roles. Most pay $209,562–$247,975 — the shaded band above. At the midpoint, this role pays about $242k versus about $229k for comparable roles.

Based on 240 similar postings.

Employer

About CrowdStrike

CrowdStrike is a leading American cybersecurity technology firm, specializing in cloud-native endpoint protection, threat intelligence, and incident response.

CrowdStrike currently has 27 open roles on FindRole.

Listed pay typically runs $125,000–$180,000 across 27 roles with salary data.

Most-posted roles

View all roles at CrowdStrike

At a glance

TL;DR · Director, AI Alignment and Interpretability (Remote)

As a senior research scientist in CrowdStrike's security-domain AI team, you will lead the alignment and interpretability research for specialized AI systems, focusing on understanding how these models represent threat concepts and detect misuse. Your daily work involves developing methods to probe model internals, identifying latent representations of vulnerability knowledge, and translating findings into actionable training interventions and evaluation protocols. You must have a deep background in mechanistic interpretability and experience with large language models, along with hands-on expertise in techniques like probing classifiers and circuit analysis. The role requires familiarity with offensive security and adversarial ML to ensure responsible deployment of AI systems that understand and reason about offensive techniques.

What you'll do

  • Own the alignment and interpretability research agenda for security-domain AI systems.
  • Develop methods to detect misuse signals in model internals by probing latent representations and analyzing activations.
  • Create evaluation frameworks with behavioral constraints and training interventions to ensure models operate within intended bounds.
  • Publish original research on interpretability for security-specialized models to advance the field.
  • Recruit, develop, and retain a lean team of research scientists while remaining an active technical contributor.

What we're looking for

  • MS or PhD in machine learning, computer science, or related field with research depth in interpretability.
  • 8+ years of experience in ML research or engineering, including direct work on large language model alignment.
  • Hands-on expertise with mechanistic interpretability methods applied to real models.
  • Experience designing and running rigorous alignment evaluations for AI safety claims.
  • Track record of leading and growing researchers while remaining an active technical contributor.
  • Background in offensive security, vulnerability research, or adversarial ML to understand misuse potential.

More like this

Similar roles

Director, Applied AI

Pfizer

NY +1 11 days ago $176,600$294,300
Python Java AI/ML Cloud Engineering Data Engineering CI/CD DevOps Kubernetes Terraform AWS Azure PostgreSQL MongoDB Observability Monitoring LLMs Retrieval Architectures Generative AI Claude Code Claude Enterprise MCP Development Claude Skill Development
Hybrid

Director, AI Enablement

Micron Technology

Boise, ID 56 days ago
CI/CD Azure OpenAI Microsoft 365 Copilot Vector databases MLOps Security automation tools Governance as Code Kubernetes Docker Python PostgreSQL AWS Grafana Prometheus

Director, AI Enablement & Ecosystem

Blackrock

New York 41 days ago $215,000$275,000
DevSecOps AI Python Java Cloud-Native Kubernetes Docker CI/CD Terraform PostgreSQL AWS GCP Azure Prometheus Grafana GitLab Jenkins Swagger GraphQL
Hybrid

Director, AI Platforms

SoFi

San Francisco, CA +1 60 days ago $198,400$341,000
AWS Kubernetes CI/CD Infrastructure as Code Policy-as-Code Observability Metrics Logging Tracing PostgreSQL Python Docker Terraform OpenCLAW

Director and Group Head, Applied AI

Novartis

Cambridge 14 days ago $194,600$361,400
Python TensorFlow Keras PyTorch Scikit-learn MLOps Docker Git AWS Azure CI/CD PostgreSQL MongoDB Spark Hadoop Jupyter GitHub Slack Zoom
Hybrid