Site Reliability Development at Google combines software and systems development to build and run large-scale, massively distributed, fault-tolerant systems. As a Senior Software Developer in SRE, you'll be responsible for ensuring Google's services maintain reliability and appropriate uptime while continuously improving performance and capacity. The role involves managing complex challenges unique to Google's scale, utilizing expertise in coding, algorithms, complexity analysis, and large-scale system design.
The position offers opportunities to work on meaningful projects with significant impact, collaborating with diverse teams in a blame-free environment that encourages intellectual curiosity and problem-solving. You'll be part of the Technical Infrastructure team, which is fundamental to Google's product portfolio, developing and maintaining data centers and building next-generation Google platforms.
The role combines hands-on technical work with technical leadership, requiring both deep technical expertise and project leadership skills. You'll be involved in the entire service lifecycle, from design through deployment and operation, while also focusing on automation and system evolution to improve reliability and velocity.
Working at Google provides competitive compensation, comprehensive benefits, and the chance to work on some of the world's most complex and impactful technical challenges. The company's culture promotes self-direction while providing support and mentorship for continuous learning and growth. This role is perfect for experienced engineers who are passionate about system reliability, automation, and building scalable solutions.