Beam is an ultrafast AI inference platform. We built a serverless runtime that launches GPU-backed containers in less than 1 second and quickly scales out to thousands of GPUs. Developers use our platform to serve apps to millions of users around the globe. We're backed by Y Combinator, Tiger Global, and prominent developer-tool founders, including the founder of Snyk and former CTO of GitHub.
We're a small, highly-technical team, with backgrounds in distributed systems and robotics. We've raised $7M from YC, Tiger, Guy Podjarny (Founder of Snyk), and Jason Warner (former CTO of Github).
In this role, you'll build full-stack AI apps with our platform. You'll build examples, demos, and sharable mini-apps that showcase the most interesting capabilities of AI — and you'll use our infrastructure to do it. You'll also optimize inference performance for a wide range of models running on our platform. You will minimize latency, maximize throughput, and experiment to make sure the apps running on our platform have industry-leading performance.
Our mission is to build the world's best compute platform for AI. Our first product is a serverless inference platform, used by companies like Coca Cola, Geospy and hundreds more. We've built our own container runtime, called beta9, which is designed for launching GPU-backed containers in under 1s.
AI has introduced a new generation of workloads, like GPU inference, sandboxes, and agents. These aren't ordinary applications that can be run as Lambdas, or Dockerized apps on VMs: they're massive, stateless containers that need to spin up in <1s, often across multiple clouds and regions.
We're searching for intensely curious, passionate, and hard-working engineers to join our mission in rebuilding the cloud for the age of AI. Your work will directly impact millions of users worldwide.