Microsoft is seeking a Senior Principal Machine Learning Engineer to join their team building a planet-scale multi-modal database that integrates Large Language Models (LLMs) and AI throughout the stack. This role combines cutting-edge machine learning engineering with practical production deployment.
The ideal candidate will lead and collaborate with engineering teams to design, build and deploy large-scale machine learning systems. Key focus areas include data pipelines, distributed training, model serving, and performance optimization. The role requires deep expertise in training large-scale modern ML models like transformers and foundation models, along with hands-on experience in optimization techniques such as mixed precision training, RLHF, and LoRA.
You'll work in a collaborative, friendly environment that values diverse perspectives and continuous learning. The team emphasizes problem-solving, design excellence, and quality software delivery. This is an opportunity to influence next-generation computing at Microsoft while growing your career.
The position offers competitive compensation ($163,000 - $296,400 base salary, higher in SF/NYC) and comprehensive benefits including healthcare, education resources, parental leave, and investment options. Microsoft maintains an inclusive culture focused on empowering others and innovating with a growth mindset.
Required qualifications include 8+ years of experience with a Bachelor's degree (or equivalent combination of education/experience) in Computer Science, ML/AI or related field. The role requires proven experience training large-scale ML models and implementing key optimization techniques. Knowledge of model evaluation, data curation, and research practices is essential.