Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale distributed systems. As an SRE III, you'll be responsible for ensuring the reliability and uptime of Google Cloud's services while managing complex challenges of scale. The role involves optimizing existing systems, building infrastructure, and automating processes.
The position requires expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll work in a culture that values intellectual curiosity, problem-solving, and openness. The team brings together diverse perspectives and backgrounds, encouraging collaboration and risk-taking in a blame-free environment.
You'll be part of the Technical Infrastructure team, which is fundamental to Google's product portfolio. The team manages data centers, develops next-generation platforms, and ensures networks run optimally for the best user experience. This role offers the opportunity to work on meaningful projects while receiving support and mentorship for professional growth.
The position comes with competitive compensation including a base salary range of $141,000-$202,000, plus bonus, equity, and comprehensive benefits. You'll be working alongside experienced engineers, contributing to critical systems that power Google Cloud's infrastructure. The role requires a strong foundation in computer science and practical experience in software development, making it ideal for those looking to impact cloud infrastructure at scale.
This is an excellent opportunity for engineers passionate about distributed systems, automation, and maintaining high-reliability services. You'll be at the forefront of cloud technology, working with cutting-edge systems while ensuring Google Cloud's services meet the highest standards of reliability and performance.