Principal Program Manager
Redmond, WA 
Share
Posted 12 days ago
Job Description
OverviewWe are looking for a Principal Program Manager to join our Azure Hardware Diagnostics and Telemetry Services (HDTS) team to lead Graphics Processing Units(GPU) and Artificial Intelligence(AI) Accelerator hardware manageability solutions. In partnership with Azure High Performance computing(HPC) and HDTS, we are on a mission to deliver the hardware, software, services, and infrastructure roadmap that enables our users to run workloads in Azure - high-performance computing simulations, Artificial Intelligence(AI), Machine Learning(ML), inferencing, remote visualization, and immersive gaming experiences. Together, we are responsible for providing Azure customers with supercomputer-class capabilities to accelerate their research, drive differentiation, and finding answers to some of the most difficult questions of science and industry, and enabling Microsoft's AI initiatives like Copilot and ChatGPT with Open AI. You will help us define and develop tactical and strategic solutions to meet the immediate and long-term needs of our services that form a critical part of Azure infrastructure. You will be our liaison to internal teams and external partners including silicon suppliers, Original Design Manufacturers(ODMs), and systems integrators. You communicate Microsoft Azure requirements to our external partners, as well as coordinate collaborative efforts with various internal Azure Engineering teams. You have effective communication skills to drive programs with multiple internal/external partners teams located in different geographical regions worldwide. As a Principal Program Manager, you have the ability to understand customer needs and a willingness to develop holistic and data-driven approaches to product solutions that delight and exceed their expectations. You have a good understanding of hardware, firmware, diagnostics, telemetry, Hardware/Software quality, the ability to apply these fundamentals to cloud-readiness, and most importantly - an unyielding passion for creating products that customers will fall in love with. The candidate will have a track record translating the needs of a highly diverse set of AI & machine learning use cases into a differentiated set of supercomputing-class systems, solutions, and experiences, and apply them to Azure's underlying infrastructure systems. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.
ResponsibilitiesAs a Principal Program Manager, you will work with the engineering team and partners to extend our Diagnostics portfolio to define, deliver, and sustain GPU and Accelerator diagnostics across the Azure fleet.Drive business critical objective and key result to improve Azure's Diagnostics and Telemetry metrics. A measurement that ensures Hardware Diagnostics signals have unambiguous root causes, and clear repair actions for data center operations teams to resolve hardware issues and get capacity back to production.Work with our Hardware New Program Initiative (NPI) teams on new GPU / AI accelerator programs to ensure our Diagnostics and Telemetry development, validation and are ahead of program Time To Market milestones.In addition to driving NPI development, this principal program manager is expected to drive strategy for long term fleet sustainability of Hardware Diagnostics workflows across organizational teams.Define and develop vendor agnostic requirements that powers Azure through active engagement with internal/external partners.Microsoft is leading the way in building standardized requirements and process with key partners in the hardware industry. You will partner with other technical program managers, Architects, and engineering managers to drive this standardization within Microsoft and across the silicon industry.Drive development teams to deliver features by agile execution across sprints, backlogs, milestones, and lead continuous planning efforts based on in-fleet diagnostics data for improvements to hardware diagnostics.Develop requirement specs and program manager functional specs for new featuresManage semester planning, engineering sprint and backlog with Engineering teamsWork with external partners (Silicon Vendors, Original Design Manufacturers(ODM), Original Equipment Manufacturers(OEM), etc.) to drive Microsoft Azure requirement and coordinate development efforts with different Azure and Microsoft Engineering teamsResponsible for bringing in systematic Software/Firmware development practice to the teamCommunicate with multiple internal/external partners teams located in different places in the world to drive programs.OtherEmbody our Culture and Values

 

Job Summary
Company
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Experience
Open
Email this Job to Yourself or a Friend
Indicates required fields