Research Scientist, Multilingual Large Language Models

Google is a leading global technology company specializing in internet-related services and products.
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
AI

Description For Research Scientist, Multilingual Large Language Models

Google is seeking a Research Scientist specializing in Multilingual Large Language Models to join their Technology & Society organization. This role combines cutting-edge research in LLMs with practical applications, focusing on advancing multilingual capabilities. The position involves developing advanced methodologies for multilingual environments, including pre-training models, enhancing instruction-tuning datasets, and optimizing multilingual tokenization. As a Research Scientist, you'll work on real-world problems spanning computer science, particularly in machine learning and natural language processing. The role offers the opportunity to contribute to the wider research community through publications and collaborations with universities worldwide. Google maintains a portfolio of research projects driven by fundamental research and product innovation, while providing the freedom to emphasize specific types of work. The Technology & Society organization aims to shape and advance technology innovations responsibly, considering their impact on users and society. This position offers the chance to work with world-class researchers, publish influential papers, and directly impact Google's multilingual AI technologies.

Last updated 4 days ago

Responsibilities For Research Scientist, Multilingual Large Language Models

  • Author research papers to share and generate impact of research results across the team and in the research community
  • Research and develop technology for improving multilingual Large Language Models (LLM) such as instruction-tuning, pre-training, multilingual reasoning
  • Research and develop technology for pre-training LLMs for target languages other than English
  • Collaborate with other research teams to expand multilingual LLM technology
  • Collaborate with Google first-party partner teams to deliver new multilingual technologies to production

Requirements For Research Scientist, Multilingual Large Language Models

Python
JavaScript
Java
  • PhD in Computer Science, a related field, or equivalent practical experience
  • Coding experience in Python, JavaScript, R, Java, or C++
  • One or more scientific publication submission(s) for conferences, journals, or public repositories
  • 2 years of coding experience in Python, JavaScript, R, Java, or C++ (preferred)
  • 1 year of experience owning and initiating research agendas (preferred)
  • Experience with modern Large Language Models (LLM) and generative models (preferred)
  • Experience with multilingual LLMs (preferred)
  • Recent publication track in related Generative Artificial Intelligence fields (preferred)

Interested in this job?

Jobs Related To Google Research Scientist, Multilingual Large Language Models

Software Engineer III, Machine Learning, Google Research

Software Engineer III position at Google Research focusing on machine learning and AI development, combining research innovation with practical product implementation.

Software Engineer III, AI/ML GenAI, Google Cloud

Software Engineer III position at Google Cloud focusing on AI/ML and GenAI development, offering competitive salary and benefits.

Software Engineer III, AI/ML, Google Cloud AI

Software Engineer III position at Google Cloud AI, focusing on machine learning infrastructure and implementation with competitive compensation and benefits.

Software Developer III, Machine Learning, Google Research

Join Google Research as a Software Developer III in Machine Learning, developing intelligent systems and next-generation technologies that impact billions of users worldwide.

Software Engineer, ML/AI Reference Models, Google Cloud

ML/AI Software Engineer role at Google Cloud, focusing on developing and integrating ML IP models with Cloud TPU SoC systems.