Google Cloud's Site Reliability Engineering (SRE) team is seeking a Staff Systems Engineer to lead and maintain their large-scale distributed systems. This role combines software and systems engineering to ensure Google Cloud's services maintain reliability and optimal performance. The position involves leading major software component designs, managing incident responses, and mentoring team members.
As a Staff SRE, you'll work on optimizing existing systems, building infrastructure, and automating processes. The role requires expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be part of Google's Technical Infrastructure team, responsible for keeping the networks running and ensuring users have the best experience possible.
The ideal candidate should have extensive experience in troubleshooting distributed systems, strong programming skills, and proven leadership abilities. You'll work in an environment that promotes intellectual curiosity, problem-solving, and openness, while collaborating with people from diverse backgrounds and perspectives.
This role offers the opportunity to work on unique scaling challenges specific to Google Cloud, contribute to critical infrastructure, and make a significant impact on Google's product portfolio. You'll be responsible for maintaining and improving the reliability, scalability, and efficiency of Google's services while leading and mentoring other team members.
The position is based in Dublin, Ireland, and requires working with Google's global teams to ensure efficient collaboration and communication. This is an excellent opportunity for experienced engineers looking to take on a leadership role in one of the world's leading tech companies while working on cutting-edge cloud infrastructure.