Microsoft Azure AI/HPC team is seeking a Senior Supercomputing Software Engineer to enable customers in deploying, monitoring, profiling, and debugging applications on hyperscale cloud infrastructure. This role is critical in maintaining Azure's position in Top500, MLPerf, and Graph500 rankings.
As a Senior Supercomputing Engineer, you'll be responsible for building and using state-of-the-art tools to ensure smooth operation of cloud-native supercomputers. You'll work with specialized tools and techniques to maintain reliability, runtime performance, and system health while meeting customer SLAs. The position involves establishing best practices, driving architectural changes, and influencing the roadmap of software and hardware components.
The role offers the opportunity to directly impact a wide range of users and facilitate the next wave of growth in AI and HPC in the cloud. Working at Microsoft, you'll join a culture focused on growth mindset, innovation, and collaboration, with comprehensive benefits including industry-leading healthcare, educational resources, and generous time off.
Key responsibilities include building tools for system metrics analysis, partnering with customers and vendors, ensuring platform consistency and scalability, and contributing to a test-driven engineering culture. The ideal candidate will have experience with AI/HPC systems, cloud infrastructure, and strong programming skills in languages like Python, Java, or JavaScript.
This position offers competitive compensation, with base pay ranging from $117,200 to $229,200 USD annually (higher in SF Bay Area and NYC). The role allows up to 100% work from home with 0-25% travel requirements, making it an excellent opportunity for those seeking to work at the forefront of cloud computing and AI technology while maintaining work-life balance.