Google is seeking a Research Engineer specializing in Vision Language Models to join their innovative team. This role sits at the intersection of cutting-edge research and practical application, focusing on developing novel core technologies in computer vision and machine learning. The team works on customized visual language models, scene understanding, 3D computer vision, and text understanding, serving multiple perception-related products at Google with a particular focus on AR applications.
The ideal candidate will have strong foundations in machine learning and computer vision, with experience in productionizing ML solutions. You'll be working with a team of Research Scientists and Software Engineers, focusing on 3D scene reconstruction, object and text understanding, and Generative AI. This position offers the opportunity to impact billions of users through Google's products while pushing the boundaries of what's possible in computer vision and AI.
The role involves developing innovative solutions for augmented reality applications, integrating advanced visual language models into consumer products, and leading Gen AI model implementation. You'll collaborate closely with research teams to optimize performance and enhance user experience, while maintaining strong relationships with stakeholders to ensure project alignment and success.
Working at Google means joining a company committed to technological innovation and equal opportunity. You'll be part of a diverse team that values fresh perspectives and creative problem-solving. The position offers the chance to work on projects that span from pure research to product development, making it an ideal opportunity for someone passionate about bridging the gap between cutting-edge research and real-world applications.