Google is seeking a Site Reliability Engineer to join their Data Movement Platform team. This role combines software and systems engineering to build and maintain large-scale, distributed systems. As an SRE, you'll be responsible for ensuring Google's services maintain high reliability and appropriate uptime while continuously improving performance and capacity.
The position requires expertise in coding, algorithms, and large-scale system design. You'll work on optimizing existing systems, building infrastructure, and automating processes to eliminate manual work. The role involves managing complex challenges unique to Google's scale while applying technical knowledge in programming, system administration, and networking.
The SRE team values intellectual curiosity, problem-solving, and openness. You'll join a diverse group of professionals with various backgrounds and perspectives, working in a blame-free environment that encourages collaboration, big thinking, and calculated risk-taking. The team provides strong support and mentorship for continuous learning and growth.
As part of the Technical Infrastructure team, you'll help maintain the architecture that powers Google's product portfolio. This includes everything from data centers to next-generation platforms, ensuring users have the best possible experience. The role involves hands-on work with systems, monitoring, troubleshooting, and implementing improvements to enhance reliability and efficiency.
This is an excellent opportunity for someone passionate about large-scale systems, automation, and maintaining high-reliability services. You'll have the chance to work on meaningful projects while contributing to the infrastructure that powers one of the world's largest technology companies.