Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale distributed systems. As an SRE III, you'll be responsible for ensuring the reliability and uptime of Google Cloud's services while managing complex challenges of scale. The role involves optimizing existing systems, building infrastructure, and automating processes.
The position requires expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll work in a culture that values intellectual curiosity, problem-solving, and openness, bringing together people with diverse backgrounds and perspectives. The team promotes self-direction while providing support and mentorship for growth.
Working at Google's Technical Infrastructure team, you'll be part of the backbone that keeps Google's product portfolio running. The role involves managing project priorities, deadlines, and deliverables, as well as designing, developing, testing, deploying, and maintaining software solutions.
This is an excellent opportunity for experienced engineers who want to work on massively distributed, fault-tolerant systems while having a direct impact on Google Cloud's infrastructure. The position offers competitive compensation ($141,000-$202,000 + bonus + equity + benefits) and the chance to work with cutting-edge technology at one of the world's leading tech companies.
The role requires strong technical skills, collaboration abilities, and a passion for system reliability. You'll be part of a team that's proud to be "engineers' engineers" and focuses on maintaining and improving Google's vast technical infrastructure.