Taro Logo

Machine Learning Engineer

Beam is a tool to build machine learning-powered applications, helping developers run code on serverless GPUs and deploy performant APIs without managing infrastructure.
$120,000 - $200,000
Machine Learning
Mid-Level Software Engineer
Hybrid
1 - 10 Employees
1+ year of experience
AI · Enterprise SaaS

Description For Machine Learning Engineer

Beam is an ultrafast AI inference platform that has built a groundbreaking serverless runtime capable of launching GPU-backed containers in under 1 second and scaling to thousands of GPUs. The platform serves millions of users globally and is backed by prestigious investors including Y Combinator and Tiger Global, along with notable developer-tool founders.

As a Machine Learning Engineer at Beam, you'll be at the forefront of optimizing inference performance across diverse models on their platform. Your role will focus on minimizing latency, maximizing throughput, and conducting experiments to achieve industry-leading performance. Your work will have direct impact on millions of users worldwide.

The ideal candidate should have strong experience with modern inference stacks like PyTorch, TensorRT, and vLLM, plus familiarity with AI workflows including ComfyUI and LoRA adaptors. Deep understanding of model compilation, quantization, and serving architectures is essential. You should be comfortable with GPU architectures and kernel-level optimizations, along with experience in CUDA, Triton, or similar frameworks.

The position offers competitive compensation ($120K-$200K with 0.20%-0.75% equity) and comprehensive benefits including health coverage, learning opportunities, and fitness stipends. While the team works in-person in New York City, they welcome exceptional remote candidates. This is an opportunity to join a fast-growing pre-Series A company that's building the future of ML infrastructure.

Last updated 24 days ago

Responsibilities For Machine Learning Engineer

  • Optimize inference performance for various models
  • Minimize latency and maximize throughput
  • Conduct experiments to achieve industry-leading performance
  • Work with GPU-backed containers and scaling systems

Requirements For Machine Learning Engineer

Python
Kubernetes
  • Experience with state-of-the-art inference stack (PyTorch, TensorRT, vLLM)
  • Familiar with modern AI workflows like ComfyUI and LoRA adaptors
  • Deep understanding of model compilation, quantization, and serving architectures
  • Familiarity with GPU architectures and kernel-level optimizations
  • Experience programming with CUDA, Triton, or similar low-level accelerator frameworks

Benefits For Machine Learning Engineer

Medical Insurance
Dental Insurance
Vision Insurance
Equity
  • Work on challenging and impactful engineering problems
  • Competitive salary and meaningful equity
  • Join a fast-growing pre-Series A company at the ground floor
  • Health, dental, and vision benefits with 90% coverage for employees and 50% for dependents
  • Opportunities to participate in events across the cloud-native and AI communities
  • Fitness stipend, learning budget

Interested in this job?

Jobs Related To Beam Machine Learning Engineer