Microsoft's Strategic Planning and Architecture (SPARC) team within Azure Hardware Systems and Infrastructure (AHSI) is seeking a Senior ML Research Engineer specializing in LLM Quantization & Model Optimization. This role combines cutting-edge research in machine learning with practical implementation in Microsoft's cloud infrastructure.
The position offers a unique opportunity to work at the intersection of large language models and hardware optimization, developing novel quantization techniques and optimization strategies for LLM deployment. You'll be part of the team that powers Microsoft's expanding cloud infrastructure, supporting services like Azure, Bing, Office 365, and Xbox Live.
As a Senior ML Research Engineer, you'll lead efforts in designing and implementing state-of-the-art model optimization techniques, working closely with cross-functional teams to improve the efficiency and performance of large language models. The role requires deep expertise in model quantization, optimization, and a strong understanding of Transformer architectures.
The position offers competitive compensation ($119,800 - $234,700 USD), comprehensive benefits, and the opportunity to work in a hybrid environment with up to 50% work from home flexibility. You'll be part of Microsoft's mission to empower every person and organization on the planet to achieve more, working with cutting-edge technology and collaborating with leading researchers and engineers.
This role is perfect for someone who combines strong theoretical knowledge with practical engineering skills, has a track record of research publications, and wants to impact the future of AI infrastructure at scale. You'll have the opportunity to influence the direction of LLM optimization at Microsoft while working with some of the most advanced AI systems in the industry.
The position requires a doctorate or equivalent experience, with at least 4 years of combined experience including 2+ years in industry focusing on low-precision model optimization. You'll be working in a collaborative environment that values innovation, technical excellence, and cross-team collaboration.