fal is seeking a Staff Software Engineer to join their Compute team in San Francisco. This role focuses on building and maintaining large-scale computation platforms for AI workloads. The position requires expertise in backend systems that handle workload orchestration, request routing, and resource management. The ideal candidate will have deep knowledge of cloud infrastructure and Linux systems.
The role involves working with cutting-edge technologies including Kubernetes, Python, and various infrastructure tools to manage GPU computing resources. You'll be responsible for developing the core platform that handles AI workload orchestration, GPU server capacity management, and maintaining the infrastructure layer using tools like Terraform and Ansible.
This is an excellent opportunity for an experienced engineer who wants to work on challenging problems in the AI infrastructure space. The company offers competitive compensation ($180K-$250K plus equity) and comprehensive benefits including health insurance and visa sponsorship. While the position is primarily in-person in San Francisco, remote work may be considered for exceptional candidates.
The ideal candidate will be a self-starter with strong communication skills and deep experience in distributed systems and infrastructure management. You'll have the opportunity to shape the future of fal's infrastructure and work on interesting technical challenges while having significant impact on the company's growth and success.