Taro Logo

Senior Software Engineer, SRE, Cloud Incident Response

Google is a global technology company that builds innovative products and services used by billions of users.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS · Cloud

Description For Senior Software Engineer, SRE, Cloud Incident Response

Google is seeking a Senior Software Engineer to join their Site Reliability Engineering (SRE) team, focusing on Cloud Incident Response. This role combines software and systems engineering to build and maintain large-scale, distributed systems for Google Cloud Platform. The position requires expertise in distributed systems, incident management, and software development.

As an SRE, you'll be responsible for ensuring the reliability and uptime of Google Cloud's services, both internal and customer-facing systems. The role involves optimizing existing systems, building infrastructure, and automating processes to improve efficiency and reliability. You'll work on complex challenges unique to Google Cloud's scale while applying your expertise in coding, algorithms, and system design.

The position offers the opportunity to work in a culture that values intellectual curiosity and problem-solving. You'll be part of an organization that brings together diverse perspectives and encourages collaboration in a blame-free environment. The role involves both independent work on meaningful projects and collaborative efforts with supportive mentorship.

Key responsibilities include maintaining GCP stability through incident support, developing incident management processes, building tooling for improved system visibility, and implementing proactive measures to reduce major incidents. You'll work closely with Cloud Support leadership and contribute to system design, capacity planning, and continuous improvement initiatives.

This is an ideal role for someone who combines strong technical skills with leadership ability, has a passion for system reliability, and wants to make a significant impact on Google's cloud infrastructure. The position offers the chance to work on cutting-edge technology while ensuring millions of users have a reliable and efficient cloud experience.

Last updated 2 days ago

Responsibilities For Senior Software Engineer, SRE, Cloud Incident Response

  • Ensure Google Cloud Platform (GCP) stability and reliability through critical incident support
  • Create training, end-to-end processes for incident management life-cycle
  • Build systems and tooling to support Incident Response team
  • Define and escalate risks in Cloud, reduce Major incident probabilities
  • Ensure the scalability and reliability of systems throughout their life-cycle

Requirements For Senior Software Engineer, SRE, Cloud Incident Response

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting distributed systems
  • 2 years of experience leading projects and providing technical leadership
  • Experience in SRE or incident management/response environments

Benefits For Senior Software Engineer, SRE, Cloud Incident Response

Medical Insurance
Dental Insurance
Vision Insurance
401k
  • Comprehensive health benefits
  • Retirement benefits

Interested in this job?

Jobs Related To Google Senior Software Engineer, SRE, Cloud Incident Response

Senior Software Developer, Site Reliability Development, Google Cloud

Senior Software Developer role focusing on Site Reliability Engineering for Google Cloud, building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Storage Components and Integrations SRE

Senior SRE position at Google focusing on storage components and integrations, requiring expertise in distributed systems and software development.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google Bengaluru focusing on Enterprise Applications, combining software and systems engineering to build and maintain large-scale distributed systems.

Senior Software Engineer, Site Reliability Engineering, Technical Infrastructure

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud's Technical Infrastructure.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring expertise in software development, system design, and technical leadership.