Principal Data Engineer, LLM/AI Platforms (Remote)
At a glance
AI generatedTL;DR
CrowdStrike seeks a Principal Data Engineer to join its expanding Data Science Platform Engineering Team, focusing on designing and deploying advanced data infrastructure for AI-driven security products. This role involves architecting scalable data platforms and pipelines for Large Language Models (LLMs) and Retrieval-Augmented Generation systems at Exabyte scale, integrating agentic workflows, and ensuring robust MLOps practices. The ideal candidate will have deep expertise in Python or JVM technologies, distributed data processing frameworks like Spark, cloud platforms such as AWS or GCP, and containerization tools including Docker and Kubernetes. With a master’s degree or PhD and over 10 years of experience in Data Engineering, candidates should demonstrate hands-on skills in LLM engineering, RAG development, and large-scale system design, alongside strong mentoring abilities and a commitment to delivering high-quality code rapidly.
Skills
What you'll do
- Design and optimize data platforms to support Large Language Models (LLMs) at Exabyte scale.
- Implement agentic workflows and agent harnessing techniques for autonomous security features.
- Develop highly scalable, fault-tolerant, and cost-effective data solutions with rapid iteration focus.
- Write production-ready code emphasizing performance, maintainability, and rigorous testing practices.
- Mentor engineers through technical workshops and design reviews to enhance AI platform knowledge.
What we're looking for
- Over 10 years of progressive experience in Data Engineering/Platform Engineering at massive scale.
- Expert-level hands-on experience in Large Language Models (LLMs) engineering, including fine-tuning and deployment.
- Proven track record of designing and delivering large-scale distributed systems with sharding, partitioning, and concurrency.
- Strong expertise in MLOps tools such as MLflow, Sagemaker, Vertex AI, and cloud platforms like AWS, GCP, or OCI.
- Demonstrated ability to write clean, elegant, performant, and well-tested code with a focus on rapid delivery.
- Experience in technical leadership and mentorship roles, including conducting workshops and leading design reviews.
- Deep understanding of agentic workflows and agent harnessing techniques for autonomous data-driven security features.
Employer
About CrowdStrike
CrowdStrike is a leading American cybersecurity technology firm, specializing in cloud-native endpoint protection, threat intelligence, and incident response.
CrowdStrike currently has 14 open roles on FindRole.
Listed pay typically runs $125,000–$180,000 across 14 roles with salary data.
Most-posted roles
- Data Engineer, Go to Market (Remote) 1
- Data Engineer, Go To Market (Remote) 1
- Detection Engineer (Remote) 1
- Director, Go-to-Market Business Applications (Remote) 1
- Machine Learning Detection Engineer (Remote, East/Central) 1