Google Cloud is seeking a Software Engineer III to join their AI/ML team, focusing on developing next-generation technologies for machine learning inference infrastructure. This role is part of the ML, Systems, & Cloud AI (MSCA) organization, which is responsible for the hardware, software, and systems infrastructure powering Google's services and Cloud platform.
The position involves working with cutting-edge LLM technology, particularly optimizing Gemini models for Google Distributed Cloud. You'll be responsible for implementing ML solutions, optimizing inference performance, and contributing to large-scale infrastructure development. The role requires expertise in ML infrastructure, distributed systems, and experience with LLMs.
This is an excellent opportunity for engineers passionate about AI/ML who want to impact billions of users through Google's services. You'll work with advanced technologies like TPUs and GPUs, contributing to products such as Vertex AI and handling critical infrastructure that powers services like Search, YouTube, and Google Cloud.
The position offers competitive compensation including base salary, bonus, equity, and comprehensive benefits. Google provides a collaborative environment where engineers can grow, innovate, and work on challenging problems at scale. The company is committed to diversity, equality, and creating a culture of belonging.
Working at Google Cloud means being at the forefront of AI technology, particularly in the rapidly evolving field of large language models. You'll have the opportunity to work with state-of-the-art tools and frameworks while contributing to the infrastructure that powers some of the world's most widely-used services.