Google is seeking a Software Specialist for their TPU (Tensor Processing Unit) team, focusing on the lower-half infrastructure components. This role is part of the ML, Systems, & Cloud AI (MSCA) organization, which is responsible for designing and implementing the hardware, software, machine learning, and systems infrastructure that powers Google's services and Google Cloud.
The position requires deep expertise in embedded systems, PCIe protocols, and low-level software development. You'll be working on critical infrastructure that supports Google's machine learning and AI initiatives, including the development and maintenance of TPU Firmware, systems software, and Pod Management/Control software.
As a Software Specialist, you'll be responsible for creating tools and dashboards for quality assurance, identifying and reporting bugs, and working closely with cross-functional teams to resolve deployment challenges. The role involves both technical development and operational support, requiring strong problem-solving skills and the ability to work under pressure during critical incidents.
The position offers competitive compensation ($166,000-$244,000 base salary) plus bonus, equity, and comprehensive benefits. You'll be working at Google's Sunnyvale location, contributing to technology that impacts billions of users worldwide. The role requires up to 50% travel to deployment sites, making it ideal for someone who enjoys both technical challenges and hands-on problem-solving in the field.
This is an excellent opportunity for experienced software engineers who are passionate about low-level systems, hardware interfaces, and large-scale infrastructure. You'll be working with cutting-edge AI hardware (TPUs) and contributing to Google's next-generation computing infrastructure. The role offers significant growth potential and the chance to work with some of the most advanced computing systems in the industry.