Google is seeking a Staff Software Engineer specializing in ML Performance and GPUs to join their ML, Systems, & Cloud AI (MSCA) organization. This role is crucial for optimizing Large Language Models (LLMs) including Google Gemini, Search, and Cloud LLM applications. The position involves deep technical work with GPU programming, performance analysis, and machine learning infrastructure optimization.
The role sits within Google's MSCA organization, which is responsible for designing and implementing the hardware, software, machine learning, and systems infrastructure that powers all Google services and Google Cloud. This team's work has global impact, from developing TPUs to running global networks and shaping the future of hyperscale computing.
As a Staff Software Engineer, you'll be working on critical performance optimization for Google's most advanced AI models. You'll collaborate with product teams to solve complex ML model performance challenges, particularly in scaling LLM training across thousands of GPUs. The role requires deep expertise in both software engineering and machine learning systems, with a focus on performance analysis and optimization.
The position offers competitive compensation including a base salary range of $197,000-$291,000, plus bonus, equity, and comprehensive benefits. This is an excellent opportunity for experienced engineers who want to work at the cutting edge of AI and machine learning infrastructure, making direct impact on Google's most important AI initiatives.
The ideal candidate will bring strong technical depth in software development, machine learning systems, and GPU programming, combined with the leadership skills needed to drive technical direction and collaborate across Google's complex organization. This role offers the chance to shape the future of AI infrastructure at one of the world's leading technology companies.