Site Reliability Engineering (SRE) at Google is an engineering discipline that combines software and systems engineering to build and maintain large-scale, distributed systems. As an SRE, you'll be responsible for ensuring Google's services maintain appropriate reliability and uptime while continuously improving performance and capacity. The role involves creative engineering solutions to operations problems, with a focus on automation and system optimization. You'll work with a diverse team to handle critical infrastructure and externally-visible systems, using a wide range of tools and approaches to solve complex problems. The position emphasizes limiting operational work, conducting blameless postmortems, and proactively identifying potential issues. Google's Technical Infrastructure team provides the foundation for Google's product portfolio, from developing and maintaining data centers to building next-generation platforms. The role offers competitive compensation, comprehensive benefits, and the opportunity to work on some of the world's largest distributed systems. This is an excellent opportunity for engineers who enjoy both software development and systems engineering, with a focus on reliability, scalability, and automation.