Google is seeking a Staff Software Engineer for their Site Reliability Engineering (SRE) team, focusing on Production Scopes. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. The position requires expertise in distributed systems design, troubleshooting, and leadership capabilities.
The role involves leading API designs for failure domain management, developing pipelines and solvers, and providing technical guidance to engineering teams. You'll be working with Google's infrastructure at scale, ensuring reliability and performance of both internal and external systems. The position offers significant technical challenges in distributed systems and automation.
As a Staff SRE, you'll collaborate with global teams, lead project execution, and participate in on-call rotations. The role requires strong programming skills, particularly in Go, and experience with cloud platforms. You'll be working on optimizing existing systems, building infrastructure, and automating processes to improve efficiency.
Google offers a competitive compensation package ranging from $197,000 to $291,000 base salary, plus bonus, equity, and comprehensive benefits. The company promotes a culture of intellectual curiosity and problem-solving, encouraging collaboration and innovation in a blame-free environment.
This is an excellent opportunity for experienced engineers who want to work on complex technical challenges at scale while providing technical leadership. The role combines hands-on technical work with strategic thinking and team leadership, making it ideal for those who want to impact Google's infrastructure while growing their career in technical leadership.