Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As a Staff SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while monitoring system capacity and performance. The role focuses on optimizing existing systems, building infrastructure, and automation. You'll tackle unique scaling challenges specific to Google Cloud, applying expertise in coding, algorithms, and large-scale system design. The Technical Infrastructure team builds and maintains the architecture behind Google's products, from data centers to next-generation platforms. The team emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll work with professionals from diverse backgrounds, collaborating on meaningful projects while receiving support and mentorship for growth. The role offers opportunities to lead major system components, guide team members, and drive improvements that create business value. Join a culture that values innovation, collaboration, and technical excellence in building the infrastructure that powers Google's global services.