Mistral AI, a pioneering company in artificial intelligence, is seeking an Applied Scientist/Research Engineer to join their innovative team. This role presents an exciting opportunity to work at the forefront of AI technology development, focusing on creating and implementing state-of-the-art models across various modalities including text, image, and speech.
The position is based in either Palo Alto or NYC, where you'll be part of a dynamic, collaborative team distributed across France, USA, UK, Germany, and Singapore. Mistral AI's mission is to democratize AI through high-performance, optimized, open-source solutions, including their AI assistant platform "le Chat."
As an Applied Scientist/Research Engineer, you'll be responsible for running pre-training and post-training operations on large GPU clusters, developing essential tools and frameworks, and managing complex research projects. The role requires expertise in PyTorch or JAX, strong Python programming skills, and the ability to work independently with large codebases.
The ideal candidate should hold a PhD or master's degree in a relevant field such as Mathematics, Physics, Machine Learning, or Computer Science, though exceptional candidates from different backgrounds are encouraged to apply. Experience with agents, multi-modality, robotics, diffusion, or time-series would be valuable.
The company offers an attractive benefits package including competitive compensation with equity, comprehensive healthcare coverage, 401K matching, generous PTO, and various allowances for meals and transportation. This is an excellent opportunity for someone passionate about AI who wants to make a meaningful impact while working with cutting-edge technology in a collaborative, low-ego environment.