Google is seeking a Senior Software Engineer for their Site Reliability Engineering (SRE) team, combining software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. This role is crucial in ensuring Google Cloud's services maintain reliability and appropriate uptime while monitoring system capacity and performance.
The position involves working with complex challenges unique to Google Cloud's scale, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design. The SRE team values intellectual curiosity, problem-solving, and openness, bringing together diverse perspectives in a blame-free environment that encourages collaboration and risk-taking.
As part of the Technical Infrastructure team, you'll be instrumental in developing and maintaining Google's data centers and building next-generation platforms. The role involves the entire service lifecycle, from design and deployment to operation and refinement. You'll work on optimizing existing systems, building infrastructure, and automating processes to eliminate manual work.
The ideal candidate will have strong experience in distributed systems, demonstrated leadership abilities, and excellent problem-solving skills. This position offers the opportunity to work on some of the world's largest computing systems while contributing to Google's core infrastructure that powers its vast product portfolio.
Working in Munich, Germany, you'll be part of a global team that's essential to keeping Google's networks running efficiently and ensuring users have the best possible experience. The role combines technical expertise with strategic thinking, requiring both hands-on engineering skills and the ability to provide technical leadership.