Site Reliability Development at Google combines software and systems development to build and run large-scale, massively distributed, fault-tolerant systems. As a Site Reliability Developer II, you'll be responsible for ensuring Google's services maintain reliability and appropriate uptime while continuously improving performance. The role involves managing complex challenges unique to Google's scale, utilizing expertise in coding, algorithms, complexity analysis, and large-scale system design.
The position offers opportunities to work on meaningful projects in a blame-free environment that encourages intellectual curiosity and problem-solving. You'll be part of a diverse team that brings together people with various backgrounds and perspectives. The role involves writing system development code, reviewing other developers' code, contributing to documentation, troubleshooting complex issues, and participating in technical design reviews.
This is an excellent opportunity for someone with software development experience who wants to focus on building and maintaining highly reliable, scalable systems. The role offers competitive compensation including base salary, bonus, equity, and benefits. You'll work alongside talented engineers while having the support and mentorship needed to learn and grow in your career.
The position requires strong technical skills in software development, data structures, and algorithms, with opportunities to work on automation, optimization, and infrastructure development. You'll be part of Google's critical technical infrastructure team, ensuring the reliability and performance of both internal and external-facing systems.