The HPC/AI (High Performance Computing and Artificial Intelligence) team at Microsoft is building the next-generation distributed AI supercomputer. This senior software engineering role focuses on developing cutting-edge networking infrastructure for large-scale AI training, ensuring high performance, low latency, and minimal jitter for distributed AI workloads.
As a Senior Software Engineer on the HPC/AI team, you'll work at the intersection of AI and high-performance computing, designing and implementing networking solutions that power state-of-the-art AI systems. You'll work with diverse network architectures and cutting-edge processor technologies, focusing on performance, scalability, and observability.
The role requires deep expertise in networking protocols, distributed systems, and high-performance computing. You'll be responsible for architecting and optimizing communication frameworks, debugging complex networking issues, and ensuring the reliability of large-scale systems. Experience with AI-specific hardware, telemetry tools, and Linux systems is highly valued.
This is an opportunity to shape the future of AI infrastructure at Microsoft, working on systems that enable breakthroughs in artificial intelligence. The position offers competitive compensation ($117,200 - $229,200), comprehensive benefits, and the chance to work on cutting-edge technology that powers the next generation of AI innovation.
The role combines technical depth in networking and distributed systems with the scale and impact of Microsoft's AI initiatives. You'll be part of a team driving innovation in AI infrastructure, with opportunities to work on challenging technical problems and contribute to the architecture of systems that will define the future of AI computing.