Principal Group Software Engineering Manager | Microsoft Careers
At a glance
AI generatedTL;DR
As a Principal Group Software Engineering Manager for Microsoft’s M365 Copilot inference team, you will lead the charge in transforming manual capacity management into an automated platform. This strategic role involves overseeing GPU fleet health and capacity planning to ensure efficient model deployment while meeting service level agreements. Your day-to-day responsibilities include building and leading high-performing teams of engineering managers and senior engineers, setting the roadmap for Copilot capacity management, and driving execution across existing teams with a plan for future growth. You will partner closely with various Microsoft divisions to align demand and supply, own live-site reliability, and establish key metrics for operational excellence. The ideal candidate has extensive experience in managing distributed systems at scale, building large-scale platforms from concept to production, and translating business needs into engineering strategies. Proficiency in Azure, M365, AI workloads, and automation technologies is essential, as well as a strong background in capacity planning and fleet management at hyperscale.
Skills
What you'll do
- Own end-to-end GPU fleet health and capacity platform to ensure reliability and observability.
- Design and scale automated model deployment processes to meet SLAs for priority workloads.
- Build a unified control plane connecting intake, planning, deployment, and fleet operations.
- Set strategy and roadmap for Copilot capacity management and the control plane.
- Coach and grow managers and senior ICs while raising the engineering bar in the organization.
What we're looking for
- Extensive experience managing distributed-systems or platform engineering teams at scale.
- Proven track record in designing, staffing, executing, and owning large-scale distributed systems.
- Strong ability to translate business needs into clear engineering strategies and execution plans.
- Demonstrated success in hiring, coaching, and developing people across multiple levels.
- Deep understanding of capacity planning, fleet management, and supply/demand optimization at hyperscale.
- Experience with Azure, M365, AI workloads, and cost models for inference and training systems.
- Background in building automation, control planes, or orchestration platforms from concept to production.
Employer
About Microsoft
Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing
Microsoft currently has 534 open roles on FindRole.
Listed pay typically runs $119,800–$234,700 across 488 roles with salary data.
Most-posted roles
- | Microsoft Careers 121
- Principal Software Engineer | Microsoft Careers 19
- Senior Software Engineer | Microsoft Careers 18
- Software Engineer II | Microsoft Careers 10
- Principal Applied Scientist | Microsoft Careers 5