Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale distributed systems. As a Senior SRE, you'll be responsible for ensuring the reliability and performance of Google Cloud's critical services. The role involves complex problem-solving, automation, and system optimization at massive scale.
The position requires strong technical expertise in distributed systems, coding, and system design. You'll work on optimizing existing systems, building infrastructure, and automating processes. The role offers opportunities to tackle unique scaling challenges specific to Google Cloud while collaborating with a diverse team of engineers.
The SRE team values intellectual curiosity, problem-solving, and open collaboration. You'll work in a blame-free environment that encourages innovation and risk-taking. The position offers competitive compensation ($166,000-$244,000 base salary plus benefits) and the chance to work with cutting-edge technology.
Key responsibilities include service lifecycle management, system design, capacity planning, monitoring system health, and implementing automation. You'll also participate in incident response and contribute to improving system reliability and performance. The role requires both technical depth and leadership skills, as you'll be guiding projects and providing technical direction.
This is an excellent opportunity for experienced engineers who want to work on some of the world's largest distributed systems while contributing to Google Cloud's infrastructure and reliability.