Hugging Face, the fastest growing AI platform with over 5 million users and 100k+ organizations, is seeking a Machine Learning Engineer focused on Fast Optimized Inference. This role combines cutting-edge AI development with practical industrial applications, working on specialized software similar to their text-generation-inference project. The position requires expertise in Python, Rust, and specialized Cuda kernels Frameworks, offering an opportunity to work with state-of-the-art ML technologies.
The role involves developing scalable software solutions, optimizing system performance, and managing production environments. You'll be part of a team that's democratizing good AI, working alongside talented professionals in a collaborative, remote-friendly environment. The company strongly values diversity, equity, and inclusivity, offering comprehensive benefits including health insurance, flexible working hours, and company equity.
Hugging Face provides an excellent environment for professional growth, with reimbursement for conferences and training, and the opportunity to contribute to major scientific advancements in AI. Their open-source libraries have garnered over 400k+ stars on Github, demonstrating their significant impact in the ML/AI community. This position offers the unique opportunity to shape the future of AI while working for a company that prioritizes both technological innovation and employee well-being.