Google's Site Reliability Engineering (SRE) team is seeking a talented engineer to join their Quality of Compute division. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. As an SRE, you'll be responsible for ensuring reliability, uptime, and continuous improvement of both internal and external systems.
The position offers unique challenges in managing complex systems at Google's scale, requiring expertise in coding, algorithms, and large-scale system design. You'll work on optimizing existing systems, building infrastructure, and creating automation solutions. The role involves collaborating with a diverse team in a blame-free environment that encourages intellectual curiosity and innovation.
Key focus areas include developing security capabilities for Google's shared infrastructure, working with distributed systems security, and implementing secure node automation and workload isolation. You'll be part of Google's Technical Infrastructure team, which is fundamental to keeping Google's vast product portfolio running efficiently and securely.
The ideal candidate will have strong programming skills, deep understanding of Unix/Linux systems, and experience with large-scale distributed systems. This role offers the opportunity to work remotely from Poland, contributing to critical infrastructure that impacts billions of users while being part of Google's inclusive and innovative culture.