Google's Site Reliability Engineering (SRE) team is seeking a Tech Lead, Senior Site Reliability Engineer to join their Technical Infrastructure organization. This role combines software and systems engineering to build and maintain Google's large-scale, distributed systems. As an SRE Tech Lead, you'll be responsible for ensuring Google's services maintain appropriate reliability and performance while driving continuous improvement.
The role requires deep technical expertise in distributed systems, with a focus on designing and troubleshooting large-scale infrastructure. You'll lead projects and provide technical direction while working on critical systems that power Google's vast product portfolio. The position involves both hands-on engineering work and technical leadership responsibilities.
SRE at Google emphasizes automation, system optimization, and eliminating manual operational work. You'll use a wide range of tools and approaches to solve complex problems, while promoting practices like blameless postmortems and proactive outage prevention. The team culture values intellectual curiosity, creative problem-solving, and open collaboration.
The Technical Infrastructure team builds and maintains the foundation that makes Google's products possible, from data centers to next-generation platforms. This role offers the opportunity to work on challenging technical problems at massive scale while leading and mentoring other engineers. You'll be part of an organization that takes pride in engineering excellence and innovative solutions.
The ideal candidate combines strong technical skills in distributed systems and software development with proven leadership experience. You'll need excellent problem-solving abilities and communication skills to succeed in this role that bridges hands-on engineering with technical leadership. This is an opportunity to make a significant impact on the reliability and scalability of Google's global infrastructure.