Site Reliability Engineering (SRE) at Google is an engineering discipline that combines software and systems engineering to build and maintain large-scale distributed systems. As a Tech Lead Senior SRE, you'll be responsible for ensuring Google's services maintain appropriate reliability and performance while driving continuous improvement. The role involves creative engineering solutions to operations problems, with a focus on optimizing systems, building infrastructure, and automation.
You'll work within Google's Technical Infrastructure team, which is fundamental to Google's product portfolio. The team develops and maintains data centers, builds next-generation platforms, and ensures networks run optimally for the best user experience. The position requires expertise in distributed systems, strong programming skills, and leadership ability.
The role offers an opportunity to work on some of the world's largest computing systems while promoting a culture of intellectual curiosity and blameless problem-solving. Google emphasizes self-direction on meaningful projects while providing support and mentorship for growth. You'll be part of an organization that brings together diverse perspectives and backgrounds, encouraging collaboration and innovative thinking in a blame-free environment.
As a Tech Lead, you'll guide projects and provide technical leadership while working hands-on with complex distributed systems. The position involves both strategic planning and practical implementation, from system design to production maintenance. You'll need to balance technical depth with leadership skills, guiding teams while maintaining technical excellence.
This role is ideal for experienced engineers who are passionate about large-scale systems, automation, and technical leadership. You'll have the opportunity to shape how Google's critical infrastructure evolves while working with cutting-edge technology and talented engineers from diverse backgrounds.