Machine Learning Engineer, Fast Optimized Inference - US Remote

Building the fastest growing platform for AI builders with over 5 million users & 100k organizations sharing 1M+ models, 300k datasets & apps.
Machine Learning
Mid-Level Software Engineer
Remote
501 - 1,000 Employees
3+ years of experience
AI

Description For Machine Learning Engineer, Fast Optimized Inference - US Remote

Hugging Face, the fastest growing AI platform with over 5 million users and 100k organizations, is seeking a Machine Learning Engineer focused on Fast Optimized Inference. This role is perfect for passionate engineers interested in creating specialized ML libraries for real-world applications. You'll work on developing software similar to text-generation-inference, focusing on industrial-level usage and scalability. The position involves creating specialized code building upon their open-source foundation, with 400k+ Github stars across their libraries.

The role combines hands-on development with performance optimization and production management. You'll be responsible for developing ML-specific software, ensuring system reliability, and monitoring production environments. The ideal candidate should be proficient in Python, Rust, and specialized Cuda kernels Frameworks, including transformers, Keras, or PyTorch.

Hugging Face offers an inclusive, development-focused environment where you'll work with industry-leading professionals. They provide comprehensive benefits including flexible remote work, health/dental/vision coverage, parental leave, and equity participation. The company strongly values diversity and community contribution, supporting the broader ML/AI ecosystem through collaborative scientific advancement.

This position offers a unique opportunity to impact AI democratization while working with cutting-edge technologies. You'll be part of a progressive, decentralized team developing solutions that enhance user experiences and push the boundaries of AI applications. The role combines technical expertise with real-world impact, making it ideal for engineers passionate about advancing AI technology while maintaining practical applications.

Last updated 4 days ago

Responsibilities For Machine Learning Engineer, Fast Optimized Inference - US Remote

  • Develop specialized software for specific machine learning use cases with broad applications
  • Create scalable software solutions for industrial purposes using existing library frameworks
  • Enhance reliability, quality, and time-to-market of software suite
  • Measure and optimize system performance
  • Manage production environment by monitoring availability and system health

Requirements For Machine Learning Engineer, Fast Optimized Inference - US Remote

Python
  • Proficiency in Python
  • Experience with Rust
  • Knowledge of specialized Cuda kernels Frameworks
  • Experience with transformers, Keras or PyTorch

Benefits For Machine Learning Engineer, Fast Optimized Inference - US Remote

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Education Budget
Equity
  • Flexible working hours
  • Remote work options
  • Health, dental, and vision benefits
  • Flexible parental leave
  • Paid time off
  • Company equity
  • Conference and training reimbursement
  • Office visits opportunity
  • Workstation equipment support

Interested in this job?

Jobs Related To Hugging Face Machine Learning Engineer, Fast Optimized Inference - US Remote

Open-Source Machine Learning Engineer - International Remote

Open-Source Machine Learning Engineer position at Hugging Face, working remotely to improve ML ecosystem through open-source development and community engagement.

Imaging Systems Engineer

Imaging Systems Engineer role at Google focusing on designing and evaluating camera systems for consumer hardware products

AI Software Engineer - Agentforce

AI Software Engineer position at Salesforce focusing on generative AI development, LLM implementation, and building scalable AI systems for the Agentforce platform.

Software Engineer, Machine Learning

Machine Learning Software Engineer role at Meta, developing scalable ML solutions and working with cutting-edge technology to impact billions of users worldwide.

Research Engineer

Research Engineer role at Meta Reality Labs Research, focusing on developing cutting-edge VR/AR technology and social presence solutions, combining research and engineering expertise.