Browser agents are broken—and whoever fixes them will shape the next decade of software.
Today, even the best browser agents from labs like OpenAI, Anthropic, and Google fail over 80% of real-world tasks, often taking three times as long as humans to complete simple actions. Foundry is addressing this by building the first robust simulator, RL training environment, and evaluation platform designed specifically for browser agents. Historically, simulation environments and standardized benchmarks were critical in advancing self-driving cars (e.g., Waymo Sim, KITTI) and LLMs (e.g., HELM, MMLU). We're applying this proven method to browser automation, enabling accurate benchmarking, rapid iteration, and real-world reliability.
As a Founding Fullstack Engineer, you'll build critical systems and user experiences powering Foundry's web simulation and evaluation platform. You'll collaborate closely with ML and RL specialists, influencing key technical decisions and directly shaping our product's future.
You'll be responsible for:
We're a technically rigorous team of ML practitioners from Scale AI, committed to impactful engineering and groundbreaking products. By solving reliability challenges in browser automation, we're positioning ourselves at the center of a transformative shift in how software interacts with web interfaces.
Join us to: