Google Cloud's Site Reliability Engineering (SRE) team is seeking a Software Engineer II to help build and maintain large-scale, distributed systems. This role combines software and systems engineering to ensure Google Cloud's services maintain optimal reliability and performance. You'll be working on complex challenges unique to Google's scale, focusing on system optimization, infrastructure development, and automation. The position requires strong coding skills, understanding of algorithms, and system design capabilities.
As an SRE, you'll be responsible for managing critical internal and external systems, ensuring appropriate uptime and continuous improvement. The role involves writing code, reviewing peers' work, contributing to documentation, troubleshooting complex system issues, and participating in technical design decisions. You'll work in a culture that values intellectual curiosity, problem-solving, and openness.
This is an excellent opportunity for engineers who want to work on meaningful projects while receiving support and mentorship for professional growth. The team promotes self-direction and risk-taking in a blame-free environment, bringing together people with diverse backgrounds and perspectives. You'll be part of Google Cloud's mission to build and maintain some of the world's most sophisticated distributed systems.
The ideal candidate should have experience with software development, strong understanding of data structures and algorithms, and the ability to manage project priorities effectively. This role offers the chance to work with cutting-edge technology while contributing to systems that impact millions of users globally.