Systems Engineer III, Host Networking Site Reliability Engineering

A global technology company that specializes in internet-related services and products.
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Enterprise SaaS · Cloud

Description For Systems Engineer III, Host Networking Site Reliability Engineering

Google's Site Reliability Engineering (SRE) team is seeking a Systems Engineer III to join their Host Networking division. This role combines software and systems engineering to build and maintain Google Cloud's large-scale distributed systems. As an SRE, you'll be responsible for ensuring the reliability and uptime of both internal and external systems while managing complex challenges unique to Google's scale. The position requires expertise in Linux systems, networking, and programming, with a focus on optimization and automation. You'll work in a culture that values intellectual curiosity and problem-solving, collaborating with diverse teams to design, develop, and maintain critical infrastructure. The role offers opportunities to work on meaningful projects while receiving support and mentorship for professional growth. Located in Dublin, Ireland, this position is perfect for engineers passionate about infrastructure, system reliability, and large-scale distributed systems.

Last updated 18 minutes ago

Responsibilities For Systems Engineer III, Host Networking Site Reliability Engineering

  • Improve the whole life-cycle of services from inception and design, through deployment, operation, and refinement
  • Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews
  • Provide guidance to other team members on managing availability and performance of mission critical services
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
  • Scale systems sustainably through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity

Requirements For Systems Engineer III, Host Networking Site Reliability Engineering

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with programming in one or more programming languages
  • 2 years of experience working with Unix/Linux systems internals and administration or networking
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Excellent problem-solving approach, with effective verbal and written communication skills

Interested in this job?

Jobs Related To Google Systems Engineer III, Host Networking Site Reliability Engineering

Software Engineer II, Site Reliability Engineering, Pub/Sub

Site Reliability Engineer role at Google focusing on maintaining and optimizing Google Cloud's Pub/Sub services with emphasis on reliability and scalability.

Software Engineer II, Site Reliability Engineering, Google Cloud

Software Engineer II position in Google Cloud's Site Reliability Engineering team, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance optimization.

Software Engineer III, Google Cloud, Site Reliability Engineering

Software Engineer III position at Google Cloud focusing on Site Reliability Engineering, building and maintaining large-scale distributed systems with remote work opportunity in Poland.

Software Engineer III, Site Reliability Engineering, Network Management

Site Reliability Engineer position at Google focusing on network management and distributed systems, requiring 2+ years of software development experience.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer III position at Google Cloud, focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.