Taro Logo

Lead ML Engineer - Multimodal AI (Gemini)

A team of scientists and engineers advancing artificial intelligence for widespread public benefit and scientific discovery.
$180,000 - $350,000
Machine Learning
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
AI
This job posting is no longer active. Check out these related jobs instead:

Job Description

Google DeepMind is at the forefront of artificial intelligence research and development, working to advance AI for widespread public benefit and scientific discovery. We're seeking a Lead ML Engineer to spearhead the development of cutting-edge multimodal AI models for Gemini, our advanced AI system.

In this role, you'll be instrumental in shaping how users interact with image, video, and audio content across Google's platforms. You'll work with state-of-the-art research and transform it into scalable, production-ready solutions. The position involves developing advanced multimodal Gemini models, expanding capabilities in image and video understanding, and launching these innovations across web, mobile, and AR/VR platforms.

You'll collaborate with world-class researchers and engineers, leveraging advanced techniques like fine-tuning, RL*F, and Policy Optimization to push the boundaries of multimodal AI. Your work will directly impact Google's broader AI goals and the future of human-AI interaction.

The ideal candidate brings deep expertise in machine learning, particularly in multimodal AI, with a strong academic background (Master's or PhD) and proven experience in deploying large-scale ML models. You'll need to demonstrate exceptional problem-solving abilities and technical leadership skills to drive innovation in this rapidly evolving field.

Join us in building the next generation of AI technology that will transform how people interact with digital content and shape the future of human-machine interaction.

Last updated 5 months ago

Responsibilities For Lead ML Engineer - Multimodal AI (Gemini)

  • Develop state-of-the-art multimodal Gemini models
  • Collaborate with research teams to evaluate and drive SOTA multimodal technologies
  • Provide technical leadership in defining strategic direction for multimodal model development
  • Lead implementation of advanced techniques including SFT, RL*F, IPO/DPO
  • Design and execute complex experiments to validate model architectures
  • Conduct data analysis to identify insights and trends
  • Develop data-driven recommendations for data flywheel improvement
  • Act as a technical mentor to team members

Requirements For Lead ML Engineer - Multimodal AI (Gemini)

Python
  • Master's degree or PhD in Computer Science, AI, Machine Learning, or related technical field
  • Experience in developing and deploying large-scale machine learning models
  • Experience with large language models and multimodal model architectures
  • Strong problem-solving and analytical skills
  • Strong data analysis skills
  • Experience with SFT, RL*F, IPO/DPO (preferred)
  • PhD in ML or considerable experience working with LLMs (preferred)

Benefits For Lead ML Engineer - Multimodal AI (Gemini)

Medical Insurance
Vision Insurance
Dental Insurance
  • Equal employment opportunities
  • Comprehensive health benefits