Google Cloud's Site Reliability Engineering (SRE) team is seeking a Staff Software Engineer to help build and maintain large-scale, distributed systems. This role combines software and systems engineering to ensure Google Cloud's services maintain optimal reliability and performance. You'll be working on complex challenges unique to Google's scale, focusing on system optimization, infrastructure development, and automation. The position requires expertise in coding, algorithms, and large-scale system design.
As a Staff SRE, you'll be responsible for the entire service lifecycle, from design through deployment and ongoing operations. You'll contribute to system design, develop software platforms, plan capacity, and conduct launch reviews. The role involves monitoring system health, implementing automation for scale, and leading incident response.
The ideal candidate brings 8+ years of software development experience and 3+ years leading projects and working with distributed systems. You'll join a culture that values intellectual curiosity, problem-solving, and collaboration in a blame-free environment. Google offers competitive compensation including a base salary range of $197,000-$291,000 plus bonus, equity, and comprehensive benefits.
This is an opportunity to work at the forefront of cloud technology, solving unique technical challenges while ensuring reliability for Google Cloud's massive user base. You'll be part of Google's Technical Infrastructure team, which builds and maintains the foundation for Google's entire product portfolio. The role offers significant technical challenges, leadership opportunities, and the chance to impact millions of users worldwide.