Google Cloud is seeking a Staff Software Engineer to join their Cloud TPU team, focusing on accelerating ML workloads and enabling efficient training and deployment of advanced models. This role is crucial in developing the infrastructure and tools needed for Tensor Processing Unit (TPU) deployment on Google Cloud Platform (GCP).
The position involves working with cutting-edge ML infrastructure, supporting frameworks like TensorFlow, PyTorch, and JAX. As part of Google's Core ML organization, you'll play a vital role in building a unified, cross-Google ML infrastructure supporting both internal and external use cases.
The role combines technical leadership with hands-on development, requiring expertise in ML infrastructure, system design, and software architecture. You'll be responsible for designing and implementing solutions that scale across multiple generations of TPU hardware, ensuring seamless integration and optimal performance for cloud users.
Key responsibilities include architecting ML infrastructure solutions, leading technical design decisions, and developing tools for qualification and automation. The position offers competitive compensation ($197,000-$291,000 base salary) plus bonus, equity, and comprehensive benefits.
This is an excellent opportunity for experienced engineers passionate about ML infrastructure and cloud computing to make a significant impact at scale. You'll work with advanced technology while contributing to Google Cloud's mission of accelerating digital transformation across industries.
The role requires a strong background in software development, ML infrastructure, and technical leadership, with opportunities to work on challenging problems at the intersection of cloud computing and machine learning. Join Google Cloud to help shape the future of cloud-based ML acceleration and infrastructure.