Taro Logo

Software Engineer, Site Reliability Engineering, Cloud API Infrastructure

Google is a global technology company that builds innovative products and services used by billions of users.
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Enterprise SaaS · Cloud

Description For Software Engineer, Site Reliability Engineering, Cloud API Infrastructure

Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and maintain large-scale distributed systems. This role focuses on ensuring reliability and uptime for Google Cloud's services while managing complex challenges of scale. As an SRE, you'll work on optimizing existing systems, building infrastructure, and automating processes. The position requires expertise in coding, algorithms, complexity analysis, and large-scale system design. Google's SRE team values intellectual curiosity, problem-solving, and openness, bringing together diverse perspectives in a blame-free environment. The role involves managing project priorities, developing software solutions, and maintaining critical infrastructure. You'll be part of a team that promotes self-direction while providing support and mentorship for professional growth. The position offers the opportunity to work on meaningful projects that directly impact Google Cloud's infrastructure and services. This role is ideal for engineers passionate about system reliability, automation, and large-scale distributed systems who want to work at the forefront of cloud technology.

Last updated 7 days ago

Responsibilities For Software Engineer, Site Reliability Engineering, Cloud API Infrastructure

  • Write product or system development code
  • Review code developed by other engineers and provide feedback to ensure best practices
  • Contribute to existing documentation or educational content
  • Triage product or system issues and debug, track, resolve by analyzing the sources of issues
  • Participate in, or lead design reviews with peers and stakeholders

Requirements For Software Engineer, Site Reliability Engineering, Cloud API Infrastructure

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with software development in one or more programming languages
  • 2 years of experience with data structures or algorithms
  • Experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • Experience working in computing, distributed systems, storage, or networking
  • Ability to debug, optimize code, and to automate routine tasks
  • Excellent communication and problem-solving skills

Benefits For Software Engineer, Site Reliability Engineering, Cloud API Infrastructure

Medical Insurance
Parental Leave
  • Equal employment opportunity
  • Inclusive work environment

Interested in this job?

Jobs Related To Google Software Engineer, Site Reliability Engineering, Cloud API Infrastructure

Software Developer III, Site Reliability Development, Google Cloud

Site Reliability Developer position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 2+ years of software development experience.

Software Developer II, Site Reliability Development, Google Cloud

Site Reliability Developer position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance optimization.

Software Developer II, Site Reliability Development, Google Cloud

Software Developer II position focused on Site Reliability Development for Google Cloud, building and maintaining large-scale distributed systems.

Site Reliability Engineer, F1 SRE

Site Reliability Engineer position at Google focusing on maintaining and improving large-scale distributed systems for Google Cloud services.

Site Reliability Engineer, Video Processing SRE

Site Reliability Engineer position at Google focusing on video processing systems, combining operations and software engineering to ensure reliability and performance at scale.