Taro Logo

Site Reliability Engineer, Spanner

Google is a global technology company that builds innovative products and services used by billions of users.
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer, Spanner

Google's Site Reliability Engineering (SRE) team is seeking a talented engineer to join their Spanner team. As an SRE, you'll combine software and systems engineering to build and maintain large-scale, massively distributed, fault-tolerant systems. This role focuses on ensuring Google's services maintain reliability and appropriate uptime while continuously improving performance.

The position offers unique challenges of scale specific to Google's infrastructure. You'll work on optimizing existing systems, building infrastructure, and creating automation solutions. The role requires expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be part of a team that manages the complex infrastructure behind Google's vast service portfolio.

SRE at Google promotes a culture of intellectual curiosity, problem-solving, and openness. The organization brings together diverse perspectives and backgrounds, encouraging collaboration and risk-taking in a blame-free environment. You'll have the opportunity to work on meaningful projects while receiving support and mentorship for professional growth.

The Technical Infrastructure team, which includes SRE, is fundamental to Google's operations, developing and maintaining data centers and building next-generation platforms. The team takes pride in being the engineers' engineers, focusing on keeping networks running optimally to ensure the best possible user experience.

This is an excellent opportunity for someone passionate about distributed systems, who enjoys solving complex technical challenges, and wants to work with cutting-edge technology at a global scale. The role offers the chance to impact billions of users while working with some of the industry's brightest minds in system reliability and infrastructure.

Last updated 3 months ago

Responsibilities For Site Reliability Engineer, Spanner

  • Work on product/tool development supporting software and infrastructure tools
  • Create, influence and review ongoing design, architecture, standards and methods for services and systems
  • Participate in service capacity planning, software performance analysis and system tuning
  • Manage availability, latency, scalability and efficiency of Google services by engineering reliability into software and systems
  • Respond to and resolve emergent service problems; write software and build automation to prevent problem recurrence

Requirements For Site Reliability Engineer, Spanner

Python
Java
Go
  • Bachelor's degree in Computer Science, or a related technical field, or equivalent practical experience
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages (e.g., Python, C++, Java)
  • 4 years of experience with data structures/algorithms and software development in one or more programming languages (e.g., C, C++, Java, Python or Go) (preferred)
  • Experience in analyzing and troubleshooting distributed systems, cloud computing, and databases, Incident Management, Coding (preferred)