Microsoft's AI Frameworks team, part of the CoreAI organization, is seeking a Senior Software Engineer to drive innovation in large-scale AI. The role focuses on enabling state-of-the-art large language model (LLM) training and inference through deep optimization across software and hardware stacks. This position is specifically part of a specialized sub-team building the end-to-end software stack for Microsoft's first-party AI accelerators.
The role involves working with PyTorch, ONNX, and other open AI frameworks, pushing the boundaries of performance, scalability, and efficiency on various hardware accelerators. You'll collaborate closely with hardware architects, compiler teams, and model experts to co-design software solutions that unlock the full potential of custom silicon.
This is a highly technical position that directly impacts Microsoft's long-term AI infrastructure strategy, powering next-generation models and services across Azure and Microsoft products. The ideal candidate should have strong expertise in C++ and/or Python, experience with LLM serving technologies, and a deep understanding of software engineering fundamentals.
The position offers competitive compensation, comprehensive benefits, and the opportunity to work on cutting-edge AI technology that operates at global scale. You'll be part of a team that's fundamental to Microsoft's AI initiatives, with the chance to shape the future of AI infrastructure and implementation.