Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale, distributed systems. As an SRE III, you'll be responsible for ensuring the reliability and uptime of Google Cloud's services, both internal and customer-facing systems. The role involves optimizing existing systems, building infrastructure, and automating processes to eliminate manual work.
The position offers unique challenges of scale specific to Google Cloud, where you'll apply your expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be part of a diverse team that values intellectual curiosity, problem-solving, and openness. The culture promotes self-direction while providing support and mentorship for growth and learning.
Working in the Technical Infrastructure team, you'll be part of the backbone that makes Google's product portfolio possible. The team is responsible for developing and maintaining data centers, building next-generation Google platforms, and ensuring networks run optimally for the best user experience.
This role combines technical expertise with project management, requiring you to handle priorities, deadlines, and deliverables while designing, developing, testing, deploying, and enhancing software solutions. You'll work in a collaborative environment where code reviews, design discussions, and continuous improvement are part of daily activities.
The position offers competitive compensation including base salary, bonus, equity, and comprehensive benefits. Google promotes an inclusive workplace, committed to equal opportunity and building a representative workforce. Join a team that takes pride in being the engineers' engineers, working on challenging problems at massive scale.