Site Reliability Engineer

A leading global technology company that specializes in internet-related services and products.
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Enterprise SaaS

Description For Site Reliability Engineer

Google is seeking a Site Reliability Engineer (SRE) to join their team in Dublin, Ireland. This role combines software and systems engineering to build and maintain Google Cloud's large-scale, distributed systems. As an SRE, you'll be responsible for ensuring the reliability and uptime of both internal and external systems while managing the challenges of scale unique to Google Cloud.

The position requires expertise in coding, algorithms, complexity analysis, and large-scale system design. You'll work on optimizing existing systems, building infrastructure, and automating processes to eliminate manual work. The role involves managing project priorities, deadlines, and deliverables, as well as designing, developing, testing, deploying, and maintaining software solutions.

SRE's culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment. The team brings together people with diverse backgrounds and perspectives, encouraging collaboration and innovation. You'll have the opportunity to work on meaningful projects while receiving support and mentorship for professional growth.

Key responsibilities include contributing to projects like Automated Troubleshooting and Service Level Objectives, identifying needs across network telemetry services, and proposing solutions. You'll also be responsible for improving systems infrastructure and engaging with partner teams to ensure system reliability.

This is an excellent opportunity for someone with strong technical skills who wants to work at the intersection of software development and systems engineering, making a direct impact on Google's infrastructure reliability.

Last updated 5 hours ago

Responsibilities For Site Reliability Engineer

  • Contribute to land projects like Automated Troubleshooting, Better Monitoring and Service Level Objective (SLOs), Podification of services, etc.
  • Identify needs across network telemetry services. Propose, build and launch cross-service solutions to satisfy those needs
  • Motivate improvements in the team's systems, infrastructure around them, and network telemetry ecosystem
  • Engage with partner teams, users to make systems reliable with relatable SLOs. Guide technical plans and goals towards creating reliable systems
  • Operate the network telemetry systems of Google production network

Requirements For Site Reliability Engineer

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages
  • Experience in software engineering with knowledge of Google production network
  • Experience with research, propose and launching engineering solutions
  • Ability to collaborate with current and prospective partner teams, product and users to discover their needs and provide solutions
  • Excellent collaboration skills with technical goals for the team and partners
  • Excellent leadership skills

Benefits For Site Reliability Engineer

Medical Insurance
Visa Sponsorship
  • Comprehensive medical insurance
  • Visa sponsorship available

Interested in this job?

Jobs Related To Google Site Reliability Engineer

Software Developer III, Site Reliability Development, Google Cloud

Site Reliability Development Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.

Software Developer II, Site Reliability Developer, Google Cloud

Site Reliability Developer position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability, automation, and system optimization.

Site Reliability Engineer, F1 SRE

Site Reliability Engineer position at Google focusing on maintaining and improving large-scale distributed systems for Google Cloud services.

Software Engineer III, Shopping Build Site Reliability Engineer

Software Engineer III position at Google focusing on Site Reliability Engineering for Shopping Build systems, requiring 2+ years of experience in distributed systems and software development.

Site Reliability Engineer, Infrastructure, Play Games

Site Reliability Engineer position at Google focusing on infrastructure and reliability for Play Games services, combining software engineering with systems operations.