Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale distributed systems. This senior role focuses on ensuring reliability and uptime for Google Cloud's services while managing complex scaling challenges. The position requires expertise in coding, algorithms, and system design, with responsibilities spanning the entire service lifecycle from design to deployment and maintenance.
The role involves working with cutting-edge infrastructure, automating processes, and solving unique scaling challenges. You'll be part of Google's Technical Infrastructure team, responsible for keeping networks running efficiently and maintaining data centers. The culture emphasizes intellectual curiosity, problem-solving, and collaboration in a blame-free environment.
As a Senior SRE, you'll lead projects, provide technical leadership, and work on meaningful initiatives that directly impact Google Cloud's infrastructure. The position offers competitive compensation ($166,000-$244,000 base salary plus benefits), and multiple location options including Sunnyvale, Waterloo, Seattle, New York, and San Francisco.
The ideal candidate should have strong experience in software development, distributed systems, and technical leadership. You'll work with a diverse team, focusing on system optimization, automation, and maintaining high reliability standards. This role offers an excellent opportunity to work on large-scale systems while contributing to Google Cloud's infrastructure development.