Google's Site Reliability Engineering (SRE) team is looking for a Senior Software Engineer to join their mission of building and running large-scale, massively distributed, fault-tolerant systems. This role combines software and systems engineering to ensure Google Cloud's services maintain reliability and appropriate uptime while continuously improving performance.
As an SRE, you'll work on optimizing existing systems, building infrastructure, and automating processes. The role offers unique challenges of scale specific to Google Cloud, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be part of a culture that values intellectual curiosity, problem-solving, and openness, working alongside people with diverse backgrounds and perspectives.
The Technical Infrastructure team is crucial in maintaining Google's architecture, from developing and maintaining data centers to building next-generation Google platforms. The role involves ensuring networks run optimally for the best user experience possible.
This position offers the opportunity to work with cutting-edge technology at massive scale, contribute to critical infrastructure, and be part of a team that values continuous learning and innovation. The ideal candidate will combine technical expertise with leadership skills to drive improvements in system reliability and performance.
Working at Google also means being part of a company committed to diversity, equality, and creating a culture of belonging. The role offers the chance to make a significant impact on systems used by billions of users while working with some of the industry's brightest minds in distributed systems and reliability engineering.