Principal Software Engineer Manager- CTJ- Poly
$139900 - $274800/year
Job Description
AI-Native Development Leads team on the disciplined use of, and improving artificial intelligence (AI) tools and practices across the software development lifecycle (SDLC). Guides team on proactively taking responsibility for the content of their AI-generated requirements, design documents, code, and other assets, and assisting other members of the team to do the same. Leads team on incorporating Responsible AI practices into the SDLC to ensure appropriate controls over AI-generated assets. Coaches team on applying SDLC and engineering health measures (e.g., Accelerate, SPACE framework, Engineering System Success Playbook [ESSP]) to guide improvements to processes and practices, especially those involving AI. Leads team on experimenting with AI tools and practices to improve their own capabilities, and providing recommendations on how to adopt them to others. Guides team within and across teams on producing extensible, maintainable, well-tested, secure, and performant code that adheres to design specifications. Guides team in creating clear test strategy that ensures solution quality, prevents regression from being introduced into existing code. Guides team on ensuring test plans incorporate security testing to validate security invariants (including negative cases). Guides teams on and leads identifying dependencies and incorporating them into the development of design documents for a product, application, service, or platform. Coordinates and collaborates with other teams to reach common goals where dependencies and validation concerns overlap. Enables communications and negotiates across teams to resolve conflicts among dependency ownership and required work. Guides others on owning and leading efforts and discussions for architecture of aspects of complex products/solutions (e.g., design, cost). Guides others on leading the testing and exploration of various design options across a set of complex product/solution scenarios, ensuring the strengths and weaknesses of each option are outlined and making recommendations for which design option is best. Guides others and writes design documents that support user stories and other product requirements. Evaluates new technologies to solve classes of problems, and determines how to integrate these technologies within existing systems. Guides employees to consider, and leads efforts to ensure system architecture and individual designs meet performance, scalability, resiliency, disaster recovery, cost of goods sold (COGS), and other requirements and expectations. Guides team on understanding the importance of building solutions that expand upon the work of others. Guides team on leading the identification of requirements for, and the comprehensive application of automation within production and deployment across products, targeting zero-touch deployment when possible. Guides others to run code in simulated, or other non-production environments to confirm functionality and error-free runtime across products. Ensures a continuous integration/continuous deployment (CICD) infrastructure is in place that promotes developer and operational agility (e.g., low lead-time-to-change metrics). Guides team on supporting and/or adopting, and potentially setting security standards for clear security code review practices for a set of products that align with design and engineering principles to raise the security hardening for both protections and detections. Ensures that their team Includes required security monitoring to ensure detection of violations. Collaborates with relevant security partners to define security promises and security invariants for the design of a product/solution while factoring in attacker/investigator personas for security monitoring and telemetry needs, ensure threat models and premortems validate upstream and downstream assumptions and security invariants, establish security breach drills and security incident response processes (e.g., impact analysis, containment), and ensure that artificial intelligence (AI) safety features are implemented for the AI production systems tied to a set of products. Guides the decision-making process around tool development. Ensures the team identifies whether open sources or internal code are available to address coding needs for a set of products, and uses or reuses them in a responsible manner as applicable. Guides others on, and uses and enhances, or builds, new software developer tools to support easier, faster, and more effective software engineering for products. Develops skills in tools outside current areas of expertise. Guides team and leads efforts to ensure the correct processes are followed to achieve a high degree of security, privacy, safety, and accessibility across solutions and teams. Guides team to create and assure presence of visible evidence (e.g., audit trail) to demonstrate compliance for products. Guides team on driving multiple groups' project plans, release plans, and work items in coordination with appropriate stakeholders (e.g., technical program managers). Guides other members on project estimation. Guides team on driving efforts to ensure required security protections and detection processes are accounted for in planning. Guides team on driving efforts to ensure project plans adhere to security, privacy, and compliance requirements. Guides team on driving efforts to ensure all code for a set of products/solutions is properly flighted for quicker mitigation of production incidents. Establishes standards for the correct measures to deploy products. Guides others on and leads efforts to ensure that solutions are deployed in a safe manner, rolling out security-sensitive features only to applicable, relevant customers and scenarios to reduce the attack surface. Guides others on and leads efforts to proactively monitor dependency status and ensure that only the latest, secure versions are deployed. Ensures deployment infrastructure is in place to allow developers' private builds for a set of products/solutions to be tested in a production-like environment. Ensures there are time and resources for engineers to conduct experiments. Managers deliver success through empowerment and accountability by modeling, coaching, and caring. Enable success across boundaries. Ensures ongoing support for services or products are robust and effective through effective telemetry and incident response processes and adhering to security best practices for the most critical or highest impact spaces such as those with deep technical domains connections or a broad set of products or services at critical junctures (e.g., early in development, urgent time horizon). Establishes guidelines and policies for creating telemetry and novel processes or tools for telemetry, engaging in live site maintenance and responding to incidents. Provides technical oversight on telemetry in systems and products to provide feedback on system behavior such as performance, reliability, availability, utility, and implements safety mechanisms resulting in iterative feedback loops resulting in subsequent monitoring designs. Provides technical oversight and expertise in efforts to classify, analyze, and interpret data and analyses on a range of metrics (e.g., health of the system, where bugs might be occurring), and technical leadership for creating outputs (e.g., notifications, dashboards) that improve monitoring and investigating security-related concerns and scenarios, system monitoring and/or issue identification and mitigation. Ensures appropriate systems are enacted to reduce incident volume and severity, meet the strategic needs of the product or service, and drives a live site first mentality. Establishes relevant metrics to determine live site response capabilities and successful incident response. Guides and oversees for creating and integrating iterative feedback loops on telemetry data for future product generation. Contributes to defining their division's strategy for enabling secure operations, security monitoring, and integration with live site investigation activities. Establishes best practices for considering and addressing the privacy implications of telemetry code changes and adding new data points. Guides team and acts as an expert for designated responsible individual (DRI) and monitors other engineers across product lines, working on call to monitor system/product/service for degradation, downtime, or interruptions. Develops a playbook, guidelines, and processes for the team to resolve issues. Coordinates people and resources to ensure DRI responsibilities are covered across teams. Ensures responses are within service level agreement (SLA) timeframe. Ensures overall DRI effectiveness and health of their team. Ensures solutions are scalable, financially responsible, and meet capture/storage guidelines. Guides team and leads efforts to classify, and analyze complex data and analyses on a range of metrics (e.g., health of the system, where bugs might be occurring). Ensures playbooks are logical and understandable. Establishes standardized processes and guides others that implement solutions and mitigations to issues impacting performance or functionality of live site services. Enables and may drive the enablement of secure operations, security monitoring, and integration with live site investigation activities. Ensures comprehensive observability and monitoring is in place for all the services their team oversees, and advocates for the inclusion of automated incident response/open source tooling, mitigation, analysis, and self-healing in those services. Guides partnership with appropriate internal (e.g., product manager, privacy/security subject matter expert, technical lead) and external (e.g. customer escalation team, public forums) stakeholders and leverages expertise to determine and confirm customer/user requirements and their feasibility within and across teams. Seeks and leverages a variety of feedback channels to incorporate customer insights into current and future designs or solution fixes. Guides team on incorporation of unwritten requirements, such as appropriate continuous feedback loops that measure actionable, quantitative (e.g., customer value, usage patterns, solution performance) and qualitative (e.g., accessibility, globalization) indicators of value. Determines additional critical metrics for success. Leads team in understanding and leading the provision of feedback on, and the advocacy of the security and privacy needs of the customer who will be using the set of solutions. Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience. These requirements include, but are not limited to the following specialized security screenings: The successful candidate must have an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph. Failure to maintain or obtain the appropriate U.S. Government clearance and/or customer screening requirements may result in employment action up to and including termination. Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience. Software Engineering M5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year. Certain roles may be eligible for benefits and other compensation.
For more details click Job Post.
About Microsoft
Microsoft Corporation is a global technology leader producing software, hardware, and cloud services including Windows, Office 365, Azure cloud platform, Xbox gaming, and Surface devices. Industry: Software & Cloud Computing