Google Cloud's Site Reliability Engineering (SRE) team is seeking a Staff Software Engineer to join their mission of building and running large-scale, massively distributed, fault-tolerant systems. This role combines software and systems engineering expertise to ensure Google Cloud's services maintain reliability and uptime while continuously improving.
As a Staff SRE, you'll tackle complex challenges of scale unique to Google Cloud, applying your expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be responsible for the entire service lifecycle, from design and deployment to operation and refinement. The role involves system design consulting, developing software platforms, capacity planning, and launch reviews.
The position offers competitive compensation ($189,000-$284,000 + bonus + equity + benefits) and the opportunity to work with a diverse team of intellectually curious problem-solvers. You'll contribute to maintaining Google's vast technical infrastructure, ensuring users have the best and fastest experience possible.
The ideal candidate brings 8+ years of experience with data structures and algorithms, strong software development skills, and proven leadership experience in distributed systems. You'll work in a blame-free environment that encourages collaboration, big thinking, and risk-taking, with ample support and mentorship for continued learning and growth.
Join Google's SRE team to help shape the future of cloud infrastructure while working on meaningful projects that impact billions of users. Your work will directly contribute to the reliability and performance of Google Cloud's critical systems, making you an essential part of Google's engineering excellence.