Genmo, a research lab dedicated to building open, state-of-the-art models for video generation, is seeking an experienced Staff AI Infra Engineer to design, build, and scale their petabyte-scale data infrastructure. This role is crucial in shaping the future of AI and pushing the boundaries of what's possible in video generation.
Key responsibilities include:
- Designing highly scalable data infrastructure and systems to process petabyte-scale data stores
- Managing large-scale distributed processing jobs for ingesting and analyzing large-scale data sets for AI training
- Optimizing storage systems to maximize performance
- Building monitoring systems to ensure reliability of data infrastructure
The ideal candidate will have:
- A Bachelor's, Master's, or PhD in Computer Science or a related field
- 5+ years of experience working with large-scale systems
- Extremely strong experience with Python
- Strong experience with large-scale distributed computing frameworks (e.g., Spark, Ray)
- Experience with a systems-level language (Rust, Go, C++)
- Familiarity with a lakehouse format such as Delta Lake
- Past experience working on machine learning (very strong plus)
- Strong past experience working with Kubernetes and cloud environments (AWS, GCP, Azure)
- Past experience designing and then scaling a large-scale system from zero to one
Genmo's team is extremely technical with leaders in distributed systems, GPU programming, and large-scale training. The ideal candidate will excel at working on highly technical problems, have excellent problem-solving skills, be detail-oriented, and communicate clearly.
Join Genmo in their mission to unlock the right brain of AGI through advanced video generation models. This is an opportunity to work on cutting-edge AI technology and contribute to the future of artificial intelligence.