Google Cloud AI is seeking a Senior Staff Software Engineer to join their New Product Introduction (NPI) and Platform Readiness team within the AI Hypercomputer Infrastructure organization. This role is critical in bringing new Tensor Processing Unit (TPU) and Graphics Processing Unit (GPU) generations to the Google Cloud platform and enabling end-to-end AI/ML compute experiences for Google Cloud users.
The position involves working with cutting-edge AI technologies and infrastructure, helping to qualify and bring up the complete Cloud TPU and Cloud GPU Hypercomputer stack, including VMs, Networking, Storage, Google Kubernetes Engine (GKE), and software tool chains. The role requires building and delivering telemetry infrastructure for both GPU/TPU fleet and critical AI/ML workloads, while optimizing stability and performance for high-priority AI/ML applications.
As a Senior Staff Software Engineer, you'll be responsible for developing and executing multi-year plans for validating end-to-end stack for TPU and GPU Products, ensuring that products deliver performance and stability to make AI/ML customers successful. The role requires extensive expertise in distributed systems and machine learning, along with strong leadership abilities to build strategic technical alignment with major organizations across Google.
The position offers competitive compensation ranging from $248,000 to $349,000 plus bonus, equity, and comprehensive benefits. You'll be working in a collaborative environment, developing strong relationships across organizational boundaries with cross-functional teams to achieve the delivery of a high-performing end-to-end stack.
This is an excellent opportunity for experienced engineers who want to make a significant impact on Google Cloud's AI infrastructure and help shape the future of machine learning technologies. The role combines technical leadership, strategic planning, and hands-on engineering work in one of the most advanced AI computing environments in the industry.