Google Cloud is seeking a Principal Engineer to lead the Vertex Generative AI Serving platform, a crucial component in delivering cutting-edge AI experiences across Google Cloud's global user base. This role combines technical leadership with strategic vision, requiring 15 years of software engineering leadership experience and deep expertise in generative AI technologies.
The position involves leading a multi-site engineering organization responsible for the performance, reliability, and efficiency of generative AI models in production. You'll collaborate with Vertex platform teams, Google research, and CoreML teams to build and enhance the LM serving platform. The role demands both technical excellence and strategic leadership, including setting technical direction, defining operational standards, and driving platform adoption.
As a Principal Engineer, you'll shape the future of Google Cloud's AI infrastructure, working with cutting-edge technologies like LLMs and diffusion models. The position offers the opportunity to impact AI development at a global scale, working with teams across Google's ecosystem. You'll be instrumental in determining how AI technologies are deployed and utilized across Google Cloud's extensive customer base.
The ideal candidate combines deep technical expertise with strong leadership abilities, capable of inspiring teams and driving innovation in AI technology. You'll need to balance technical decisions with business strategy, ensuring the platform meets both current needs and future scalability requirements. This role offers the chance to work at the forefront of AI technology while leading and mentoring teams across multiple locations.
Working at Google Cloud means joining a team that's transforming how businesses operate through technology. You'll be part of an organization that values diversity, innovation, and technical excellence, with the opportunity to shape the future of AI infrastructure at a global scale.