Google Cloud's XBorg team is seeking a Software Engineer III to join their Borg Control Plane division. XBorg is a sophisticated orchestration layer that handles scheduling of throughput-oriented workloads across clusters, with a particular focus on Machine Learning training and inference workloads. The role is part of the ML, Systems, & Cloud AI (MSCA) organization, which is responsible for the infrastructure powering Google's core services and Cloud offerings.
As a Software Engineer III, you'll work on developing and enhancing XBorg's innovative features, including weighted fair queuing, opportunistic resource allocation, and platform flexibility. These improvements directly impact resource efficiency for ML workloads across major Alphabet products. The position offers the opportunity to work on large-scale distributed systems that affect billions of users worldwide.
The ideal candidate should have strong experience in software development, data structures, and algorithms. Knowledge of Machine Learning infrastructure is highly valued. You'll be part of a team that prioritizes security, efficiency, and reliability while pushing the boundaries of hyperscale computing. This role offers the chance to work on cutting-edge technology like Google Cloud's Vertex AI and contribute to the future of cloud computing and machine learning infrastructure.
Working at Google means joining a company that's committed to innovation, technical excellence, and building technology that makes a global impact. You'll have the opportunity to collaborate with world-class engineers, work on challenging technical problems, and help shape the future of cloud computing and machine learning infrastructure.