Google DeepMind is at the forefront of artificial intelligence research and development, working to advance AI for widespread public benefit and scientific discovery. We're seeking a Lead ML Engineer to spearhead the development of cutting-edge multimodal AI models for Gemini, our advanced AI system.
In this role, you'll be instrumental in shaping how users interact with image, video, and audio content across Google's platforms. You'll work with state-of-the-art research and transform it into scalable, production-ready solutions. The position involves developing advanced multimodal Gemini models, expanding capabilities in image and video understanding, and launching these innovations across web, mobile, and AR/VR platforms.
You'll collaborate with world-class researchers and engineers, leveraging advanced techniques like fine-tuning, RL*F, and Policy Optimization to push the boundaries of multimodal AI. Your work will directly impact Google's broader AI goals and the future of human-AI interaction.
The ideal candidate brings deep expertise in machine learning, particularly in multimodal AI, with a strong academic background (Master's or PhD) and proven experience in deploying large-scale ML models. You'll need to demonstrate exceptional problem-solving abilities and technical leadership skills to drive innovation in this rapidly evolving field.
Join us in building the next generation of AI technology that will transform how people interact with digital content and shape the future of human-machine interaction.