Google's Site Reliability Development team is seeking a Software Developer III to join their team in Waterloo. This role combines software and systems development to build and run large-scale, massively distributed, fault-tolerant systems. The position focuses on ensuring Google's services maintain reliability and appropriate uptime while continuously improving performance.
As a Site Reliability Developer, you'll tackle complex challenges unique to Google's scale, applying expertise in coding, algorithms, complexity analysis, and large-scale system design. The role involves optimizing existing systems, building infrastructure, and automating processes to eliminate manual work.
The team culture emphasizes intellectual curiosity, problem-solving, and openness. Google brings together people with diverse backgrounds and perspectives, encouraging collaboration and risk-taking in a blame-free environment. The organization promotes self-direction while providing support and mentorship for professional growth.
Key responsibilities include writing system development code, reviewing other developers' code, contributing to documentation, troubleshooting complex issues, and participating in technical design reviews. The ideal candidate will have experience with distributed systems, strong programming skills, and the ability to manage project priorities effectively.
This is an excellent opportunity for someone passionate about large-scale systems engineering who wants to work on some of the world's most complex technical challenges while being part of a collaborative and innovative team at Google.