Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering expertise to build and maintain large-scale, distributed systems. This senior role focuses on ensuring reliability and optimal performance of Google Cloud's critical services. You'll work on complex challenges unique to Google's scale, applying your expertise in coding, algorithms, and system design.
The position requires strong technical leadership and hands-on development experience, particularly in distributed systems and service reliability. You'll be responsible for the entire service lifecycle, from design and implementation to deployment and maintenance. The role involves both proactive system improvements and reactive incident response.
As part of Google's Technical Infrastructure team, you'll help build and maintain the architecture that powers Google's vast product portfolio. The team emphasizes intellectual curiosity, problem-solving, and collaboration in a blame-free environment. You'll work with diverse colleagues from various backgrounds and perspectives, contributing to Google Cloud's infrastructure evolution.
The compensation package is competitive, ranging from $166,000 to $244,000 base salary, plus bonus, equity, and comprehensive benefits. This is an excellent opportunity for experienced engineers who want to impact billions of users while working with cutting-edge technology at massive scale.
The role offers the chance to work in several major tech hubs, including Sunnyvale, Waterloo, Seattle, New York, and San Francisco, providing flexibility in location while maintaining close collaboration with global teams. Google's culture promotes self-direction on meaningful projects while providing strong support and mentorship for continued learning and growth.