OpenAI is seeking a Senior Software Engineer to join their Frontier Clusters Infrastructure team, which builds and manages some of the world's largest supercomputers for cutting-edge AI model training. This role combines distributed systems engineering with high-performance computing, focusing on designing and operating large-scale compute clusters that power advanced AI research. Based in San Francisco with a hybrid work model, the position offers a competitive salary range of $380K-$460K plus equity and comprehensive benefits.
The ideal candidate will work on developing software for orchestrating massive compute clusters, managing resource allocation, and automating cluster lifecycle operations. They should have strong expertise in distributed systems, experience with cloud platforms (particularly Azure), and proficiency in languages like Python and Go. The role requires someone who can interface effectively with researchers and engineering teams while ensuring the reliability and security of cluster systems.
OpenAI offers an exceptional benefits package including comprehensive health coverage, mental wellness support, generous parental leave, and professional development opportunities. The company's mission-driven approach focuses on ensuring AI benefits humanity, making this an opportunity to work on transformative technology while maintaining a strong emphasis on safety and ethical considerations.
The position combines technical challenges of hyperscale infrastructure with the excitement of working at the forefront of AI development. You'll be part of a team that directly enables OpenAI's most cutting-edge model training, making this role crucial for anyone interested in both distributed systems and artificial intelligence advancement.