Site Reliability Engineer

Google

A leading global technology company that specializes in internet-related services and products.

Dublin, Ireland

Site Reliability

Mid-Level Software Engineer

In-Person

5,000+ Employees

2+ years of experience

Enterprise SaaS

Description For Site Reliability Engineer

Google's Site Reliability Engineering (SRE) team is at the forefront of maintaining and optimizing the company's massive distributed systems. As an SRE, you'll combine software and systems engineering expertise to ensure Google Cloud's services maintain exceptional reliability and performance. The role involves working with both internally critical and externally-visible systems, focusing on optimizing existing systems, building infrastructure, and implementing automation solutions.

The position offers unique challenges of scale specific to Google Cloud, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll be part of a diverse and intellectually curious team that values problem-solving and openness. The role emphasizes self-direction while providing support and mentorship for professional growth.

Key responsibilities include managing project priorities, developing software solutions, and ensuring system reliability through automated troubleshooting and monitoring. You'll work with network telemetry services, propose and implement cross-service solutions, and collaborate with partner teams to establish and maintain service level objectives (SLOs).

This is an excellent opportunity for engineers passionate about large-scale systems, automation, and reliability. The role offers exposure to cutting-edge technology and the chance to impact millions of users while working with some of the industry's most complex distributed systems. Google's culture of innovation, combined with its commitment to work-life balance and professional development, makes this an ideal position for those looking to advance their careers in site reliability engineering.

Last updated 12 days ago

Responsibilities For Site Reliability Engineer

Contribute to land projects like Automated Troubleshooting, Better Monitoring and Service Level Objective (SLOs), Podification of services, etc.
Identify needs across network telemetry services. Propose, build and launch cross-service solutions to satisfy those needs
Motivate improvements in the team's systems, infrastructure around them, and network telemetry ecosystem
Engage with partner teams, users to make systems reliable with relatable SLOs. Guide technical plans and goals towards creating reliable systems. Operate the network telemetry systems of Google production network

Requirements For Site Reliability Engineer

Linux

Python

Java

Bachelor's degree in Computer Science, a related field, or equivalent practical experience
2 years of experience with data structures/algorithms and software development in one or more programming languages
Experience in software engineering with knowledge of Google production network
Experience with research, propose and launching engineering solutions
Ability to collaborate with current and prospective partner teams, product and users to discover their needs and provide solutions
Excellent collaboration skills with technical goals for the team and partners
Excellent leadership skills

Benefits For Site Reliability Engineer

Medical Insurance

401k

Parental Leave

Education Budget

Comprehensive health benefits
Retirement plans
Parental leave
Professional development opportunities

Google

A leading global technology company that specializes in internet-related services and products.

Dublin, Ireland

Site Reliability

Mid-Level Software Engineer

In-Person

5,000+ Employees

2+ years of experience

Enterprise SaaS

Interested in this job?

Jobs Related To Google Site Reliability Engineer

Software Developer III, Site Reliability Development, Google Cloud

Google

Site Reliability Development Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.

Software Developer II, Site Reliability Developer, Google Cloud

Google

Site Reliability Developer position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and system optimization.

Site Reliability Engineer, F1 SRE

Google

Site Reliability Engineer position at Google focusing on maintaining and improving large-scale distributed systems for Google Cloud services.

Site Reliability Engineer

Google

Site Reliability Engineer position at Google Dublin, combining software and systems engineering to ensure reliability of Google Cloud services.

Software Engineer III, Shopping Build Site Reliability Engineer

Google

Software Engineer III position at Google focusing on Site Reliability Engineering for Shopping Build systems, requiring 2+ years of experience in distributed systems and software development.