Taro Logo

Staff Software Engineer, Site Reliability Engineering

Google is a global technology leader specializing in internet-related services and products.
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Staff Software Engineer, Site Reliability Engineering

Google is seeking a Staff Software Engineer for their Site Reliability Engineering (SRE) team. SRE combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. The role involves ensuring Google Cloud's services have reliability and uptime appropriate to customer needs, while maintaining a fast rate of improvement.

Key responsibilities include managing the lifecycle of services, supporting pre-launch activities, scaling systems through automation, working on critical Google Cloud services, and solving operations problems using software engineering principles. The ideal candidate will have extensive experience with data structures, algorithms, software development, and leading projects involving distributed systems.

Google offers a culture of diversity, intellectual curiosity, and problem-solving. The Technical Infrastructure team, which includes SRE, is crucial in developing and maintaining data centers and building next-generation Google platforms. This role provides an opportunity to work on unique, large-scale challenges while collaborating with a diverse team in a supportive, mentorship-rich environment.

The position requires a bachelor's degree in Computer Science or related field (or equivalent experience), along with significant experience in software development and distributed systems. Preferred qualifications include expertise in computing, distributed systems, storage, or networking, as well as strong problem-solving and communication skills.

Google is an equal opportunity employer committed to building a diverse and inclusive workforce. They offer accommodations for applicants with needs and require English proficiency for effective global collaboration.

Last updated 8 months ago

Responsibilities For Staff Software Engineer, Site Reliability Engineering

  • Engage in and improve the whole lifecycle of services from inception, design to deployment, operation and refinement
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Work on the availability, scalability, efficiency and latency of some of Google Cloud's most critical services
  • Solve operations problems by using software engineering principles and best practices. Collaborate with the developer teams on design, architecture and processes

Requirements For Staff Software Engineer, Site Reliability Engineering

Java
Python
Go
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of experience leading projects and designing, analyzing, and troubleshooting distributed systems

Interested in this job?