Google Cloud is seeking a Staff Software Engineer to join their Machine Learning Infrastructure team within the Core ML organization. This role focuses on optimizing Google's Machine Learning resources, particularly working with TPUs and GPUs used across all Google products. The position requires extensive experience in software development, machine learning, and system architecture. As part of the ML, Systems, & Cloud AI (MSCA) organization, you'll be responsible for developing monitoring tools and dashboards to track performance and efficiency of ML resources, identifying areas for improvement, and driving efficiency gains across Google's products. The role involves working with cutting-edge technologies including TensorFlow, Kubernetes, and Google's custom TPUs, while contributing to Google Cloud's Vertex AI platform. This is an opportunity to impact billions of users while working on next-generation technologies in areas such as distributed computing, large-scale system design, and artificial intelligence. The position offers the chance to work with advanced ML infrastructure, lead junior engineers, and collaborate across different teams to improve Google's ML fleet efficiency. The ideal candidate will combine technical expertise in ML systems with leadership abilities and a drive for innovation.