Google is seeking a Staff Software Engineer to join their ML, Systems, & Cloud AI (MSCA) organization, focusing on ML Performance and GPUs. This role is critical in optimizing Large Language Models (LLMs) including Google Gemini, Search, and Cloud LLM applications. The position requires deep expertise in machine learning infrastructure, GPU programming, and performance analysis.
The role involves working with cutting-edge ML technologies and infrastructure that powers Google's most important services. You'll be responsible for analyzing and optimizing LLM performance, maintaining benchmarks, and working closely with product teams to solve complex ML model performance challenges. The position offers the opportunity to impact billions of users through Google's services and Cloud platforms.
The ideal candidate will bring extensive experience in software development, ML infrastructure optimization, and GPU programming. You'll work in a complex, matrixed organization, collaborating with various teams to improve ML model efficiency and performance at scale. The role offers competitive compensation including base salary, bonus, equity, and comprehensive benefits.
This is an excellent opportunity for someone passionate about machine learning, performance optimization, and large-scale systems who wants to work at the forefront of AI technology. You'll be part of shaping the future of hyperscale computing while working on some of the most advanced ML systems in the industry.