Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering expertise to build and maintain large-scale, distributed systems. As a Technical Lead in SRE, you'll be responsible for ensuring the reliability and performance of Google Cloud's critical services, both internal and customer-facing. The role involves complex system design, automation, and optimization work unique to Google's scale.
The position requires deep technical expertise in distributed systems, with a focus on reliability, scalability, and performance optimization. You'll lead projects and collaborate with teams across Google to design and implement solutions that maintain and improve service reliability. The role combines hands-on technical work with technical leadership responsibilities.
Working in Google's Technical Infrastructure team, you'll be part of the organization that builds and maintains the foundation of Google's product portfolio. The team takes pride in solving complex engineering challenges and building next-generation platforms. The culture emphasizes intellectual curiosity, problem-solving, and collaboration in a blame-free environment.
This is an excellent opportunity for experienced engineers who want to work on challenging distributed systems problems at massive scale while providing technical leadership. The role offers the chance to work with cutting-edge technology and contribute to the reliability of services used by millions of users worldwide. Google provides a supportive environment for learning and growth, with opportunities to collaborate with some of the best engineers in the industry.