Taro Logo

Software Engineering Manager, Site Reliability Engineering, Platform, Devices

Google is a global technology company that builds and maintains large-scale, distributed systems and infrastructure.
$150,000 - $300,000
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Staff Software Engineer, Site Reliability Engineering, Google Cloud

Staff SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring expertise in software development and system reliability.

Systems Engineer III, Host Networking Site Reliability Engineering

Systems Engineer III position at Google focusing on Host Networking Site Reliability Engineering, combining software and systems engineering to maintain large-scale distributed systems.

Staff Software Engineer, Site Reliability Engineering

Staff Software Engineer position focused on Site Reliability Engineering at Google, building and maintaining large-scale distributed systems.

Senior Staff Software Engineer, Site Reliability Engineering, Google Cloud

Senior Staff Software Engineer position at Google Cloud focusing on Site Reliability Engineering, building and maintaining large-scale distributed systems with competitive compensation and benefits.

Staff Software Engineer, Site Reliability Engineering, Google Cloud

Staff SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Description For Software Engineering Manager, Site Reliability Engineering, Platform, Devices

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As a Software Engineering Manager in the SRE team, you'll lead a team responsible for ensuring Google's services maintain reliability and appropriate uptime while continuously improving performance and capacity. The role involves managing complex challenges unique to Google's scale, utilizing expertise in coding, algorithms, and large-scale system design.

The position is within Google's Technical Infrastructure team, which is fundamental to keeping Google's extensive product portfolio running smoothly. You'll be responsible for leading a team that maintains and develops data centers and next-generation Google platforms, ensuring optimal network performance and user experience.

The role requires strong technical leadership skills, with a focus on mentoring and growing engineering teams in a fast-paced environment. You'll oversee critical services' availability and performance, implement automation strategies, and manage global on-call rotations. The position combines technical expertise with people management, requiring both deep systems knowledge and the ability to inspire and develop team members.

SRE at Google promotes a culture of diversity, intellectual curiosity, and problem-solving in a blame-free environment. The team brings together individuals with varied backgrounds and perspectives, encouraging collaboration and innovative thinking. This role offers the opportunity to work on meaningful projects with significant impact while receiving support and mentorship for continuous learning and growth.

Last updated a month ago

Responsibilities For Software Engineering Manager, Site Reliability Engineering, Platform, Devices

  • Lead a team of Software/Systems Engineers on projects for users and be responsible for uptime, mentor the team, and establish credibility through quality technical execution
  • Own availability and performance of key services and build automation to prevent problem recurrence
  • Develop and grow engineering teams through mentoring, coaching, succession planning, and retention strategies
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write, and deliver software to improve the availability, scalability, latency, and efficiency of Google's services

Requirements For Software Engineering Manager, Site Reliability Engineering, Platform, Devices

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of experience managing people or teams, leading projects, and designing, analyzing, and troubleshooting distributed systems
  • Experience with algorithms, data structures and analysis, software design, Unix/Linux systems, IP networking, performance, and application issues
  • Ability to set and drive strategy while providing technical guidance to the team
  • Ability to inspire and motivate the engineering team
  • Ability to research code, networking, operating systems, and storage

Benefits For Software Engineering Manager, Site Reliability Engineering, Platform, Devices

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Visa Sponsorship
  • Equal opportunity employer
  • Accommodations for applicants with special needs
  • Inclusive work environment

Interested in this job?