Mistral AI is seeking a Lead Site Reliability Engineer (SRE) to spearhead their infrastructure team in building reliable, fault-tolerant, and scalable systems. This role combines leadership responsibilities with hands-on technical work, split equally between team leadership (33%), operations (33%), and development (33%). The position involves managing high-performing teams while ensuring the reliability of critical distributed environments and improving customer interactions with core products.
The role requires extensive experience in DevOps/SRE (10+ years) and leadership capabilities. You'll be responsible for designing and maintaining scalable infrastructure, implementing monitoring systems, and driving automation improvements. The position involves working with cutting-edge AI/ML technologies and contributing to open-source projects.
Mistral AI offers a flexible work environment with offices across Europe (Paris, London, Barcelona/Madrid, Berlin/Munich/Frankfurt). The company provides competitive compensation, including equity, and comprehensive benefits. They maintain a strong culture focused on rigorous reasoning, audacious thinking, and customer success.
The ideal candidate will bring expertise in cloud computing, distributed systems, and modern DevOps tools, combined with strong leadership and communication skills. Experience with AI/ML environments and high-performance computing would be particularly valuable. This is an opportunity to shape the future of AI infrastructure at a pioneering company with a global presence.