Systems Engineer III, Site Reliability Engineering, Cloud Databases

Google is a global technology leader that develops innovative products and services used by billions of people.
Site Reliability
Mid-Level Software Engineer
In-Person
2+ years of experience
Enterprise SaaS

Description For Systems Engineer III, Site Reliability Engineering, Cloud Databases

Google's Site Reliability Engineering (SRE) team combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As a Systems Engineer III in the Cloud Databases team, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while monitoring system capacity and performance. The role involves optimizing existing systems, building infrastructure, and automating processes. You'll work in a diverse, collaborative environment that values intellectual curiosity and problem-solving. The Technical Infrastructure team is responsible for the architecture that powers Google's product portfolio, from developing and maintaining data centers to building next-generation platforms. The role offers opportunities to work on unique scaling challenges while using expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be part of a team that promotes self-direction, mentorship, and growth while managing critical services and driving system improvements.

Last updated 13 days ago

Responsibilities For Systems Engineer III, Site Reliability Engineering, Cloud Databases

  • Improve the whole life-cycle of services from inception and design, through deployment, operation, and refinement
  • Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews
  • Provide guidance to other team members on managing availability and performance of mission critical services
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
  • Lead incident response and blameless postmortems
  • Scale systems sustainably through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity

Requirements For Systems Engineer III, Site Reliability Engineering, Cloud Databases

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with programming in one or more programming languages
  • 2 years of experience working with Unix/Linux systems internals and administration or networking
  • Experience working in computing, distributed systems, storage, or networking
  • Experience in designing, analyzing, and troubleshooting distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Excellent problem-solving and communication skills

Interested in this job?

Jobs Related To Google Systems Engineer III, Site Reliability Engineering, Cloud Databases

Software Developer III, Site Reliability Development, Google Cloud

Site Reliability Developer role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.

Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Site Reliability Engineer position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Technical Program Manager III, Networking Site Reliability Engineering, Cloud Edge

Technical Program Manager III position at Google, focusing on Networking Site Reliability Engineering for Cloud Edge infrastructure, offering competitive compensation and benefits.

Systems Engineer III, Site Reliability Engineering

Site Reliability Engineer position at Google focusing on building and maintaining large-scale distributed systems with emphasis on automation and reliability.

Databases Site Reliability Engineer

Site Reliability Engineer position at Google focusing on database systems, requiring expertise in programming, Linux systems, and distributed computing.