Google's Site Reliability Engineering (SRE) team is seeking a Staff Software Engineer to join their mission of building and maintaining large-scale, massively distributed, fault-tolerant systems. This role combines software and systems engineering to ensure Google Cloud's services maintain reliability and optimal performance.
As a Staff SRE, you'll be at the forefront of managing complex challenges unique to Google Cloud's scale. Your work will focus on optimizing existing systems, building infrastructure, and implementing automation to eliminate manual work. The role requires expertise in coding, algorithms, complexity analysis, and large-scale system design.
The position is based in Munich, Germany, where you'll join Google's Technical Infrastructure team. You'll be responsible for the architecture that powers Google's product portfolio, from developing and maintaining data centers to building next-generation Google platforms. The role involves the entire service lifecycle, from design and deployment to operation and refinement.
Key aspects of the role include system design consulting, capacity planning, launch reviews, and maintaining service health through monitoring and metrics. You'll work in a culture that values intellectual curiosity, problem-solving, and openness, collaborating with professionals from diverse backgrounds and perspectives.
This is an ideal opportunity for an experienced engineer who wants to impact billions of users while working with cutting-edge distributed systems technology. The role offers the chance to grow professionally in a supportive environment that encourages self-direction while providing mentorship and learning opportunities.