Lead ML Engineer - Multimodal AI (Gemini)

Google DeepMind

A team of scientists and engineers advancing artificial intelligence for widespread public benefit and scientific discovery.

London, UK

$180,000 - $350,000

Machine Learning

Staff Software Engineer

In-Person

5,000+ Employees

8+ years of experience

This job posting is no longer active. Check out these related jobs instead:

Job Description

Google DeepMind is at the forefront of artificial intelligence research and development, working to advance AI for widespread public benefit and scientific discovery. We're seeking a Lead ML Engineer to spearhead the development of cutting-edge multimodal AI models for Gemini, our advanced AI system.

In this role, you'll be instrumental in shaping how users interact with image, video, and audio content across Google's platforms. You'll work with state-of-the-art research and transform it into scalable, production-ready solutions. The position involves developing advanced multimodal Gemini models, expanding capabilities in image and video understanding, and launching these innovations across web, mobile, and AR/VR platforms.

You'll collaborate with world-class researchers and engineers, leveraging advanced techniques like fine-tuning, RL*F, and Policy Optimization to push the boundaries of multimodal AI. Your work will directly impact Google's broader AI goals and the future of human-AI interaction.

The ideal candidate brings deep expertise in machine learning, particularly in multimodal AI, with a strong academic background (Master's or PhD) and proven experience in deploying large-scale ML models. You'll need to demonstrate exceptional problem-solving abilities and technical leadership skills to drive innovation in this rapidly evolving field.

Join us in building the next generation of AI technology that will transform how people interact with digital content and shape the future of human-machine interaction.

Last updated 5 months ago

Responsibilities For Lead ML Engineer - Multimodal AI (Gemini)

Develop state-of-the-art multimodal Gemini models
Collaborate with research teams to evaluate and drive SOTA multimodal technologies
Provide technical leadership in defining strategic direction for multimodal model development
Lead implementation of advanced techniques including SFT, RL*F, IPO/DPO
Design and execute complex experiments to validate model architectures
Conduct data analysis to identify insights and trends
Develop data-driven recommendations for data flywheel improvement
Act as a technical mentor to team members

Requirements For Lead ML Engineer - Multimodal AI (Gemini)

Python

Master's degree or PhD in Computer Science, AI, Machine Learning, or related technical field
Experience in developing and deploying large-scale machine learning models
Experience with large language models and multimodal model architectures
Strong problem-solving and analytical skills
Strong data analysis skills
Experience with SFT, RL*F, IPO/DPO (preferred)
PhD in ML or considerable experience working with LLMs (preferred)

Benefits For Lead ML Engineer - Multimodal AI (Gemini)

Medical Insurance

Vision Insurance

Dental Insurance

Equal employment opportunities
Comprehensive health benefits