Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale distributed systems. This role focuses on ensuring Google Cloud's services maintain reliability and appropriate uptime while managing capacity and performance. The position involves optimizing existing systems, building infrastructure, and automating processes.
As an SRE III, you'll tackle complex scaling challenges unique to Google Cloud, applying expertise in coding, algorithms, and system design. The role requires managing project priorities and deliverables while designing, developing, and maintaining software solutions. You'll work in a culture that values intellectual curiosity and problem-solving, collaborating with diverse teams in a blame-free environment.
The Technical Infrastructure team is responsible for the architecture supporting Google's product portfolio, from data centers to next-generation platforms. The role offers competitive compensation ($141,000-$202,000 + bonus + equity + benefits) and the opportunity to work with cutting-edge technology at massive scale.
Key responsibilities include code development, peer review, documentation, system troubleshooting, and participating in technical design decisions. The ideal candidate will have experience with distributed systems, strong programming skills, and the ability to analyze complex technical problems. This position offers the chance to work on meaningful projects while receiving support and mentorship for continued growth and learning.