Cloud ML Compute Services (CMCS) at Google is seeking a Principal Engineer to drive the technical strategy for ML Frameworks and Models. This role focuses on enabling massive scale ML Services powered by GPUs and TPUs. The successful candidate will lead the development of cloud-based ML solutions for training and serving large models (e.g., LLMs, MoE, Diffusion, Ranking/Recommendation) using cutting-edge AI hardware like Google TPUs and NVIDIA GPUs.
Key responsibilities include:
The ideal candidate will have extensive experience in software development, distributed systems, machine learning algorithms, and cloud infrastructure. They should be able to work cross-functionally, possess excellent problem-solving skills, and have outstanding communication abilities.
This role offers the opportunity to shape the future of Google's ML compute services and drive innovation in the rapidly evolving field of AI and machine learning. The position comes with a competitive salary range, bonuses, equity, and benefits, reflecting the high-level expertise required for this principal engineering role.