Google's Site Reliability Engineering (SRE) team combines software and systems engineering to build and maintain large-scale, distributed systems. As an SRE at Google, you'll be responsible for ensuring the reliability and performance of Google's critical internal and external systems. The role involves creative problem-solving, automation, and system optimization.
The position requires expertise in software development, distributed systems, and technical leadership. You'll work on designing, building, and maintaining efficient large-scale systems, with a focus on reliability, uptime, capacity, and performance. The SRE team values diversity, intellectual curiosity, and a blame-free environment for problem-solving.
As a Senior SRE, you'll be involved in the entire service lifecycle, from design to deployment and maintenance. You'll contribute to system design, develop software platforms, conduct capacity planning, and perform launch reviews. The role involves monitoring system health, implementing automation, and participating in incident response.
Google offers a competitive compensation package, including a base salary range of $166,000-$244,000, plus bonus, equity, and comprehensive benefits. The company is committed to diversity and inclusion, providing equal opportunities regardless of background.
The Technical Infrastructure team, which includes SRE, is fundamental to Google's operations, maintaining the architecture that powers Google's extensive product portfolio. This role offers the opportunity to work on some of the world's largest distributed systems while collaborating with talented engineers in a supportive, growth-oriented environment.