Software Engineer III, Site Reliability Engineering, Google Cloud

Google is a global technology company that builds and runs large-scale, distributed systems and services.
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Enterprise SaaS · Cloud

Description For Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale, distributed systems. As an SRE, you'll be responsible for ensuring the reliability and uptime of Google Cloud's services, both internal and customer-facing systems. The role involves complex challenges of scale unique to Google Cloud, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design.

The position offers opportunities to work on meaningful projects in a blame-free environment that values diversity, intellectual curiosity, and problem-solving. You'll be part of a team that promotes self-direction while providing support and mentorship for professional growth. The role involves managing project priorities, deadlines, and deliverables, as well as designing, developing, testing, deploying, maintaining, and enhancing software solutions.

SRE's focus includes optimizing existing systems, building infrastructure, and automating processes to eliminate manual work. You'll be responsible for monitoring system capacity and performance, ensuring services meet customer needs, and maintaining a fast rate of improvement. The role combines technical expertise with collaborative teamwork in a diverse environment that brings together people with various backgrounds and perspectives.

As an SRE at Google Cloud, you'll contribute to a culture that values openness and continuous learning. The position offers unique challenges in managing large-scale distributed systems while working with cutting-edge technology. This role is perfect for someone who enjoys both software development and systems engineering, with a keen interest in reliability and scalability challenges.

Last updated 7 days ago

Responsibilities For Software Engineer III, Site Reliability Engineering, Google Cloud

  • Write product or system development code
  • Review code developed by other engineers and provide feedback to ensure best practices
  • Contribute to existing documentation or educational content
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues
  • Participate in, or lead design reviews with peers and stakeholders

Requirements For Software Engineer III, Site Reliability Engineering, Google Cloud

Python
Go
Java
Kubernetes
Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with data structures/algorithms and software development
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Systematic problem-solving approach
  • Effective verbal and written communication skills
  • English proficiency

Benefits For Software Engineer III, Site Reliability Engineering, Google Cloud

Medical Insurance
Parental Leave
Equity
  • Equal employment opportunity
  • Accommodation for special needs
  • Global work environment

Interested in this job?

Jobs Related To Google Software Engineer III, Site Reliability Engineering, Google Cloud

Software Developer III, Site Reliability Development, Google Cloud

Site Reliability Developer role at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance.

Software Developer II, Site Reliability Development, Google Cloud

Site Reliability Development Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance.

Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Site Reliability Engineer position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Databases Site Reliability Engineer

Site Reliability Engineer position at Google focusing on database systems, requiring expertise in distributed systems, programming, and Linux/Unix administration.

Systems Engineer III, Site Reliability Engineering

Systems Engineer III position at Google focusing on Site Reliability Engineering, building and maintaining large-scale distributed systems with 2+ years of experience required.