Director, AI Alignment and Interpretability (Remote)

CrowdStrike

Remote

Quick summary

Work type: Remote
Location: Remote
Salary: $195,000–$290,000 / yr
Posted: 5 days ago

Market check

Salary context

Competitive pay

How this pay compares to similar roles

Similar $229k

This role $242k

$184k most similar roles pay here $301k

This role pays more than 60% of similar roles. Most pay $209,562–$247,975 — the shaded band above. At the midpoint, this role pays about $242k versus about $229k for comparable roles.

Based on 240 similar postings.

Employer

About CrowdStrike

CrowdStrike is a leading American cybersecurity technology firm, specializing in cloud-native endpoint protection, threat intelligence, and incident response.

CrowdStrike currently has 27 open roles on FindRole.

Listed pay typically runs $125,000–$180,000 across 27 roles with salary data.

Most-posted roles

View all roles at CrowdStrike

At a glance

TL;DR · Director, AI Alignment and Interpretability (Remote)

Apply Now Log in to save

As a senior research scientist in CrowdStrike's security-domain AI team, you will lead the alignment and interpretability research for specialized AI systems, focusing on understanding how these models represent threat concepts and detect misuse. Your daily work involves developing methods to probe model internals, identifying latent representations of vulnerability knowledge, and translating findings into actionable training interventions and evaluation protocols. You must have a deep background in mechanistic interpretability and experience with large language models, along with hands-on expertise in techniques like probing classifiers and circuit analysis. The role requires familiarity with offensive security and adversarial ML to ensure responsible deployment of AI systems that understand and reason about offensive techniques.

Skills

Python TensorFlow PyTorch Keras Scikit-learn Jupyter Git GitHub MLOps CI/CD AWS Google Cloud Platform Azure Machine Learning Docker Kubernetes Prometheus Grafana PostgreSQL MongoDB Redis Linux NVIDIA CUDA CUDA C/C++

What you'll do

Own the alignment and interpretability research agenda for security-domain AI systems.
Develop methods to detect misuse signals in model internals by probing latent representations and analyzing activations.
Create evaluation frameworks with behavioral constraints and training interventions to ensure models operate within intended bounds.
Publish original research on interpretability for security-specialized models to advance the field.
Recruit, develop, and retain a lean team of research scientists while remaining an active technical contributor.

What we're looking for

MS or PhD in machine learning, computer science, or related field with research depth in interpretability.
8+ years of experience in ML research or engineering, including direct work on large language model alignment.
Hands-on expertise with mechanistic interpretability methods applied to real models.
Experience designing and running rigorous alignment evaluations for AI safety claims.
Track record of leading and growing researchers while remaining an active technical contributor.
Background in offensive security, vulnerability research, or adversarial ML to understand misuse potential.

Similar roles

Director, Applied AI

Pfizer

NY +1 11 days ago $176,600–$294,300

Python Java AI/ML Cloud Engineering Data Engineering CI/CD DevOps Kubernetes Terraform AWS Azure PostgreSQL MongoDB Observability Monitoring LLMs Retrieval Architectures Generative AI Claude Code Claude Enterprise MCP Development Claude Skill Development

Hybrid

Save

Director, AI Enablement

Micron Technology

Boise, ID 56 days ago

CI/CD Azure OpenAI Microsoft 365 Copilot Vector databases MLOps Security automation tools Governance as Code Kubernetes Docker Python PostgreSQL AWS Grafana Prometheus

Save