Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their mission of building and maintaining large-scale, massively distributed, fault-tolerant systems. This role combines software and systems engineering expertise to ensure Google Cloud's services maintain optimal reliability and performance.
As an SRE at Google, you'll work on complex challenges unique to Google's scale, focusing on optimizing existing systems, building infrastructure, and implementing automation. The role requires expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be part of a culture that values intellectual curiosity, problem-solving, and openness.
The Technical Infrastructure team is fundamental to Google's product portfolio, developing and maintaining data centers and building next-generation platforms. Your responsibilities will span the entire service lifecycle, from design and deployment to operation and refinement. You'll be involved in system design consulting, capacity planning, launch reviews, and maintaining service health through monitoring and automation.
This position offers the opportunity to work with cutting-edge technology at massive scale, collaborate with talented engineers, and directly impact billions of users. The role combines technical leadership with hands-on engineering, requiring both strategic thinking and practical implementation skills.
Google offers a collaborative environment that promotes self-direction while providing support and mentorship for growth. You'll be part of a diverse team that brings together people with various backgrounds and perspectives, working in a blame-free environment that encourages innovation and risk-taking.