Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering expertise to build and maintain large-scale, distributed systems. This senior role focuses on ensuring reliability and optimal performance of Google Cloud's critical services. You'll work on complex challenges unique to Google's scale, applying your expertise in coding, algorithms, and system design. The position offers opportunities to optimize existing systems, build infrastructure, and automate processes.
The role is part of Google's Technical Infrastructure team, which is fundamental to Google's product portfolio. You'll be involved in the complete lifecycle of services, from design to deployment and refinement. Key responsibilities include system design consulting, capacity planning, monitoring system health, and implementing automation for scalability.
The culture emphasizes intellectual curiosity, problem-solving, and collaboration in a blame-free environment. Google values diversity of perspectives and backgrounds, providing support and mentorship for professional growth. The position offers competitive compensation including base salary, bonus, equity, and comprehensive benefits.
This is an ideal opportunity for experienced engineers passionate about reliability, scalability, and system design who want to make an impact at global scale. You'll work with cutting-edge technology while contributing to systems that serve billions of users.