Google's Site Reliability Engineering (SRE) team is seeking a talented engineer to join their Spanner team. As an SRE, you'll combine software and systems engineering to build and maintain large-scale, massively distributed, fault-tolerant systems. This role focuses on ensuring Google's services maintain reliability and appropriate uptime while continuously improving performance.
The position offers unique challenges of scale specific to Google's infrastructure. You'll work on optimizing existing systems, building infrastructure, and creating automation solutions. The role requires expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be part of a team that manages the complex infrastructure behind Google's vast service portfolio.
SRE at Google promotes a culture of intellectual curiosity, problem-solving, and openness. The organization brings together diverse perspectives and backgrounds, encouraging collaboration and risk-taking in a blame-free environment. You'll have the opportunity to work on meaningful projects while receiving support and mentorship for professional growth.
The Technical Infrastructure team, which includes SRE, is fundamental to Google's operations, developing and maintaining data centers and building next-generation platforms. The team takes pride in being the engineers' engineers, focusing on keeping networks running optimally to ensure the best possible user experience.
This is an excellent opportunity for someone passionate about distributed systems, who enjoys solving complex technical challenges, and wants to work with cutting-edge technology at a global scale. The role offers the chance to impact billions of users while working with some of the industry's brightest minds in system reliability and infrastructure.