Google's Site Reliability Engineering (SRE) team is at the forefront of maintaining and optimizing large-scale, distributed systems that power Google Cloud's services. This role combines software and systems engineering to ensure the reliability and performance of both internal and customer-facing systems. As an SRE focusing on Borg Node, you'll be working with Google's container orchestration system, tackling unique scaling challenges while applying your expertise in coding, algorithms, and system design.
The position offers an opportunity to work in a culture that values intellectual curiosity and problem-solving, where you'll collaborate with diverse teammates in a blame-free environment. You'll be involved in optimizing existing systems, building infrastructure, and creating automation solutions to eliminate manual work. The role requires both technical expertise and the ability to think strategically about system reliability and performance.
The team promotes self-direction while providing strong support and mentorship for professional growth. You'll be working on meaningful projects that directly impact Google's infrastructure reliability while having the chance to learn and grow in a collaborative environment. This role is perfect for someone who is passionate about both software engineering and systems operations, and wants to work on technology at a massive scale.
The position offers the opportunity to work with cutting-edge technology while being part of a team that ensures Google's services maintain their world-class reliability. You'll be joining a company known for its innovative approach to technology and strong engineering culture, with excellent opportunities for career growth and development.