Mistral AI is at the forefront of democratizing artificial intelligence through their high-performance, open-source models and solutions. As a Research Engineer in Machine Learning, you'll be integral to building and optimizing large-scale learning systems that power their open-weight models. The role offers two distinct paths: joining the Platform RE Team to enhance shared training frameworks and data pipelines, or embedding within a research squad focusing on areas like Alignment, Pre-training, or Multimodal development.
The position requires expertise in large-scale machine learning systems, with opportunities to work on cutting-edge deep-learning techniques including sparsified 70B+ runs and distributed training across thousands of GPUs. You'll collaborate closely with Research Scientists, turning innovative ideas into scalable, production-ready code. The role combines both research and practical implementation, requiring strong Python skills and experience with modern ML frameworks.
Mistral AI offers a dynamic, collaborative environment with teams distributed across France, USA, UK, Germany, and Singapore. The company culture emphasizes creativity, low-ego attitudes, and team spirit. They provide comprehensive benefits including competitive compensation, health coverage, and various lifestyle perks. This is an excellent opportunity for experienced ML engineers who want to impact the future of AI while working with state-of-the-art technology and a global team.
The position offers flexibility with hybrid working arrangements in Paris or London, or remote work within EU/UK with monthly hub visits. This role is perfect for those who want to bridge the gap between cutting-edge AI research and production-ready systems, while contributing to a company that's actively shaping the future of AI technology.