Taro Logo

Machine Learning Engineer, Fast Optimized Inference - US Remote

AI platform building company with over 5 million users & 100k organizations, sharing 1M+ models, 300k datasets & apps
United States
Machine Learning
Senior Software Engineer
Remote
501 - 1,000 Employees
AI

Description For Machine Learning Engineer, Fast Optimized Inference - US Remote

Hugging Face, the fastest growing AI platform with over 5 million users and 100k+ organizations, is seeking a Machine Learning Engineer focused on Fast Optimized Inference. This role combines cutting-edge AI development with practical industrial applications, working on specialized software similar to their text-generation-inference project. The position requires expertise in Python, Rust, and specialized Cuda kernels Frameworks, offering an opportunity to work with state-of-the-art ML technologies.

The role involves developing scalable software solutions, optimizing system performance, and managing production environments. You'll be part of a team that's democratizing good AI, working alongside talented professionals in a collaborative, remote-friendly environment. The company strongly values diversity, equity, and inclusivity, offering comprehensive benefits including health insurance, flexible working hours, and company equity.

Hugging Face provides an excellent environment for professional growth, with reimbursement for conferences and training, and the opportunity to contribute to major scientific advancements in AI. Their open-source libraries have garnered over 400k+ stars on Github, demonstrating their significant impact in the ML/AI community. This position offers the unique opportunity to shape the future of AI while working for a company that prioritizes both technological innovation and employee well-being.

Last updated 8 days ago

Responsibilities For Machine Learning Engineer, Fast Optimized Inference - US Remote

  • Develop specialized software for specific machine learning use cases with broad applications
  • Create scalable software solutions for industrial purposes using existing library frameworks
  • Enhance reliability, quality, and time-to-market of software suite
  • Measure and optimize system performance
  • Manage production environment by monitoring availability and system health

Requirements For Machine Learning Engineer, Fast Optimized Inference - US Remote

Python
  • Proficiency in Python
  • Experience with Rust
  • Knowledge of specialized Cuda kernels Frameworks
  • Experience with transformers, Keras or PyTorch

Benefits For Machine Learning Engineer, Fast Optimized Inference - US Remote

Medical Insurance
Dental Insurance
Vision Insurance
Education Budget
Equity
Parental Leave
  • Flexible working hours
  • Remote work options
  • Health, dental, and vision benefits
  • Flexible parental leave
  • Paid time off
  • Conference and training reimbursement
  • Education reimbursement
  • Company equity
  • Office spaces in NYC and Paris with visit opportunities
  • Workstation equipment provided

Interested in this job?

Jobs Related To Hugging Face Machine Learning Engineer, Fast Optimized Inference - US Remote