Site Reliability Development at Google combines software and systems engineering to build and maintain large-scale, distributed systems. This role focuses on ensuring Google's services maintain high reliability and performance while constantly improving. As a Software Developer III in Site Reliability Development, you'll tackle unique scaling challenges while applying expertise in coding, algorithms, and system design. The role involves both optimizing existing systems and building new infrastructure through automation.
The position requires strong technical skills in distributed systems, coding, and problem-solving. You'll work in a collaborative environment that values intellectual curiosity and creative solutions. The team brings together diverse perspectives and promotes self-direction while providing mentorship and support for growth.
Key responsibilities include writing and reviewing code, contributing to documentation, troubleshooting complex system issues, and participating in technical design decisions. You'll manage project priorities and deliverables while working with cutting-edge technology at massive scale.
This is an excellent opportunity for someone passionate about both software development and systems engineering who wants to work on Google's critical infrastructure. The role offers significant technical challenges, opportunities to learn and grow, and the chance to make a real impact on systems used by billions of users.
The position is based in Waterloo, Canada, and requires strong collaboration skills as you'll work with teams across Google. The culture emphasizes learning, innovation, and taking ownership of projects that improve Google's infrastructure reliability.