Genmo, an innovative AI research lab, is seeking a Senior Platform Engineer to join their team in San Francisco. The company is focused on developing cutting-edge video generation models to advance artificial general intelligence (AGI). This role presents a unique opportunity to work at the intersection of infrastructure and AI, building and maintaining the systems that power next-generation video generation models.
As a Senior Platform Engineer, you'll be responsible for architecting and managing a sophisticated multi-cluster infrastructure that spans both cloud and on-premises GPU environments. Your work will be crucial in ensuring the reliable deployment and operation of AI models at scale, with a focus on zero-downtime deployments and optimal performance.
The ideal candidate brings strong expertise in distributed systems, with at least 5 years of experience building production-grade systems. You should be proficient in systems programming languages like Go or Rust, along with Python, and have deep knowledge of modern DevOps practices and tools including Kubernetes, infrastructure-as-code, and observability systems.
This role offers the opportunity to work on challenging technical problems at scale, including GPU capacity planning, global load balancing, and building robust observability systems for AI infrastructure. You'll also play a key leadership role, mentoring team members and influencing the technical direction of the platform.
Working at Genmo means being at the forefront of AI video generation technology, with the chance to shape the infrastructure that powers next-generation AI models. The company offers a collaborative environment where you can make significant contributions to the future of AI technology while working with state-of-the-art tools and systems.