Machine Learning Engineer, Fast Optimized Inference - US Remote

Hugging Face

AI platform building company with over 5 million users & 100k organizations, sharing 1M+ models, 300k datasets & apps

United States

Machine Learning

Senior Software Engineer

Remote

501 - 1,000 Employees

This job posting is no longer active. 😔

Job Description

Hugging Face, the fastest growing AI platform with over 5 million users and 100k+ organizations, is seeking a Machine Learning Engineer focused on Fast Optimized Inference. This role combines cutting-edge AI development with practical industrial applications, working on specialized software similar to their text-generation-inference project. The position requires expertise in Python, Rust, and specialized Cuda kernels Frameworks, offering an opportunity to work with state-of-the-art ML technologies.

The role involves developing scalable software solutions, optimizing system performance, and managing production environments. You'll be part of a team that's democratizing good AI, working alongside talented professionals in a collaborative, remote-friendly environment. The company strongly values diversity, equity, and inclusivity, offering comprehensive benefits including health insurance, flexible working hours, and company equity.

Hugging Face provides an excellent environment for professional growth, with reimbursement for conferences and training, and the opportunity to contribute to major scientific advancements in AI. Their open-source libraries have garnered over 400k+ stars on Github, demonstrating their significant impact in the ML/AI community. This position offers the unique opportunity to shape the future of AI while working for a company that prioritizes both technological innovation and employee well-being.

Last updated 3 months ago

Responsibilities For Machine Learning Engineer, Fast Optimized Inference - US Remote

Develop specialized software for specific machine learning use cases with broad applications
Create scalable software solutions for industrial purposes using existing library frameworks
Enhance reliability, quality, and time-to-market of software suite
Measure and optimize system performance
Manage production environment by monitoring availability and system health

Requirements For Machine Learning Engineer, Fast Optimized Inference - US Remote

Python

Proficiency in Python
Experience with Rust
Knowledge of specialized Cuda kernels Frameworks
Experience with transformers, Keras or PyTorch

Benefits For Machine Learning Engineer, Fast Optimized Inference - US Remote

Medical Insurance

Dental Insurance

Vision Insurance

Education Budget

Equity

Parental Leave

Flexible working hours
Remote work options
Health, dental, and vision benefits
Flexible parental leave
Paid time off
Conference and training reimbursement
Education reimbursement
Company equity
Office spaces in NYC and Paris with visit opportunities
Workstation equipment provided