Taro Logo

Site Reliability Engineer, Quality of Compute

Google is a global technology company that builds innovative products and services used by billions of users.
Site Reliability
Mid-Level Software Engineer
Remote
5,000+ Employees
2+ years of experience
Enterprise SaaS · Cybersecurity

Description For Site Reliability Engineer, Quality of Compute

Google's Site Reliability Engineering (SRE) team is seeking a talented engineer to join their Quality of Compute division. This role combines software and systems engineering to build and maintain large-scale, distributed systems that power Google Cloud's services. As an SRE, you'll be responsible for ensuring reliability, uptime, and continuous improvement of both internal and external systems.

The position offers unique challenges in managing complex systems at Google's scale, requiring expertise in coding, algorithms, and large-scale system design. You'll work on optimizing existing systems, building infrastructure, and creating automation solutions. The role involves collaborating with a diverse team in a blame-free environment that encourages intellectual curiosity and innovation.

Key focus areas include developing security capabilities for Google's shared infrastructure, working with distributed systems security, and implementing secure node automation and workload isolation. You'll be part of Google's Technical Infrastructure team, which is fundamental to keeping Google's vast product portfolio running efficiently and securely.

The ideal candidate will have strong programming skills, deep understanding of Unix/Linux systems, and experience with large-scale distributed systems. This role offers the opportunity to work remotely from Poland, contributing to critical infrastructure that impacts billions of users while being part of Google's inclusive and innovative culture.

Last updated 10 days ago

Responsibilities For Site Reliability Engineer, Quality of Compute

  • Design and develop security capabilities to protect Google's shared infrastructure at the node, cluster, campus, and global scales
  • Gain expertise in securing distributed systems, encompassing areas such as secure node automation, workload isolation, and privilege escalation prevention
  • Collaborate with SREs and Development teams to establish a security posture against evolving threats

Requirements For Site Reliability Engineer, Quality of Compute

Linux
Python
Go
Java
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with programming in one or more programming languages
  • 2 years of experience working with Unix/Linux systems internals and administration or networking
  • Experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • Understanding of Unix/Linux operating system internals, computer architecture, networking protocols
  • Ability to debug, optimize code, and automate tasks
  • Excellent problem-solving and communication skills

Benefits For Site Reliability Engineer, Quality of Compute

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Comprehensive health benefits
  • Retirement plans
  • Parental leave
  • Remote work options

Interested in this job?

Jobs Related To Google Site Reliability Engineer, Quality of Compute