Microsoft CoreAI is developing a new AI-first app stack to empower developers in shaping the future with Artificial Intelligence. The Foundry Models team is seeking a Principal Software Engineer to work on providing serverless access to various AI models from providers like DeepSeek, Mistral AI, Cohere, and Meta. This role offers the opportunity to tackle complex performance optimization challenges in inference runtimes and contribute to features serving enterprise customers.
As a Principal Software Engineer in the CoreAI team, you'll be at the forefront of large-scale AI inferencing, working with cutting-edge models and ensuring the platform can deliver AI capabilities to developers and enterprises alike. The role requires a highly technical, hands-on approach to solving complex problems and collaboration with multiple partner teams.
The position offers competitive compensation ranging from $137,600 to $267,000 per year (higher in SF Bay Area and NYC), along with comprehensive benefits including healthcare, educational resources, and parental leave. Microsoft provides a collaborative environment focused on growth mindset and innovation, where you'll have the opportunity to shape the future of AI technology while working with industry-leading professionals.
This role combines technical leadership, hands-on development, and strategic thinking, making it ideal for experienced engineers passionate about AI and scalable systems. You'll be instrumental in building and optimizing the infrastructure that powers next-generation AI applications while mentoring other engineers and contributing to Microsoft's mission of empowering every person and organization on the planet.