Taro Logo

Senior Software Developer, Site Reliability Development, Google Cloud

Google is a global technology company that builds innovative products and services used by billions of users.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS · Cloud

Description For Senior Software Developer, Site Reliability Development, Google Cloud

Google Cloud is seeking a Senior Software Developer for Site Reliability Development to join their team in Waterloo. This role combines software and systems development to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE Developer, you'll ensure Google's services maintain reliability and appropriate uptime while focusing on system optimization, infrastructure development, and automation. You'll tackle unique scaling challenges specific to Google's infrastructure while applying expertise in coding, algorithms, and large-scale system design.

The role involves managing the complete lifecycle of services from design to deployment and refinement. You'll work on system design consulting, develop software platforms and frameworks, conduct capacity planning, and perform launch reviews. Post-deployment responsibilities include monitoring system health, implementing automation for scale, and handling incident response.

The Technical Infrastructure team builds and maintains the architecture supporting Google's entire product portfolio. From developing data centers to creating next-generation platforms, the team ensures optimal performance and reliability of Google's vast network infrastructure.

This position offers the opportunity to work with complex distributed systems at massive scale, collaborate with talented engineers, and contribute to Google's critical infrastructure. The team promotes intellectual curiosity, problem-solving, and openness while encouraging self-direction on meaningful projects. You'll have access to mentorship and support for continued learning and growth.

The ideal candidate combines strong software development skills with systems engineering expertise and brings experience leading technical projects. If you're passionate about reliability, scalability, and automation, and want to work on infrastructure powering billions of users, this role offers an exciting opportunity to make a significant impact.

Last updated 2 days ago

Responsibilities For Senior Software Developer, Site Reliability Development, Google Cloud

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Practice sustainable incident response and blameless postmortems

Requirements For Senior Software Developer, Site Reliability Development, Google Cloud

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership

Benefits For Senior Software Developer, Site Reliability Development, Google Cloud

Medical Insurance
401k
Parental Leave
  • Comprehensive health benefits
  • Retirement plans
  • Parental leave

Interested in this job?

Jobs Related To Google Senior Software Developer, Site Reliability Development, Google Cloud

Senior Software Engineer, SRE, Cloud Incident Response

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.

Senior Software Engineer, Storage Components and Integrations SRE

Senior SRE position at Google focusing on storage components and integrations, requiring expertise in distributed systems and software development.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google Bengaluru focusing on Enterprise Applications, combining software and systems engineering to build and maintain large-scale distributed systems.

Senior Software Engineer, Site Reliability Engineering, Technical Infrastructure

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud's Technical Infrastructure.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring expertise in software development, system design, and technical leadership.