Google's Site Reliability Development team is at the forefront of maintaining and optimizing large-scale, distributed systems that power Google's critical services. This role combines software development expertise with systems engineering to ensure Google's services maintain high reliability and performance standards. As a Software Developer III in Site Reliability Development, you'll tackle unique scaling challenges while working on infrastructure development and automation. The position requires strong coding skills, understanding of distributed systems, and the ability to solve complex technical problems.
The role offers the opportunity to work with cutting-edge technology and contribute to systems that impact billions of users. You'll join a culture that values intellectual curiosity and collaboration, working alongside diverse teammates in a blame-free environment. The team promotes self-direction while providing support and mentorship for professional growth.
Your responsibilities will include writing and reviewing code, contributing to technical documentation, troubleshooting complex system issues, and participating in design reviews. You'll work with various teams across Google to optimize system performance and reliability, while also having the chance to innovate and implement new solutions to challenging technical problems.
The position offers exposure to Google's vast technical infrastructure and the opportunity to work with some of the most sophisticated distributed systems in the industry. You'll be part of a team that values both technical excellence and work-life balance, with opportunities for career growth and learning from experienced engineers in the field.