LinkedIn, the world's largest professional network, is seeking a Director of Site Health SRE to lead their critical infrastructure reliability efforts. This key leadership position offers an opportunity to shape the reliability and performance of a platform used by millions globally.
The role combines technical leadership with strategic oversight, requiring expertise in Site Reliability Engineering (SRE) practices and large-scale systems operations. You'll be responsible for ensuring LinkedIn's site reliability through incident management, capacity planning, and building robust self-serve platforms. The position offers a competitive compensation package ranging from $203,000 to $333,000, plus additional benefits including stock options and performance bonuses.
As the Director, you'll lead a distributed team of SREs and Site Operations professionals, driving initiatives to enhance site reliability, implement observability solutions, and foster a reliability-first culture across the engineering organization. The role requires a strong technical background with at least 10 years of engineering leadership experience and deep knowledge of reliability engineering practices.
Key responsibilities include developing incident management platforms, leading post-incident reviews, implementing reliability metrics and dashboards, and ensuring adequate capacity planning for traffic spikes. You'll work closely with product and platform teams to maintain service level objectives (SLOs) and drive continuous improvement in site reliability.
The position is hybrid, based in Sunnyvale, CA, offering flexibility while maintaining strong team collaboration. LinkedIn provides a culture built on trust, care, inclusion, and professional growth opportunities. This role is perfect for an experienced engineering leader passionate about large-scale systems reliability and team development.
Requirements include a CS degree, 10+ years of engineering leadership, and extensive experience with reliability engineering and large-scale infrastructure. The ideal candidate will have strong communication skills, experience with programming languages like Python, Go, Java, or Ruby, and a proven track record of building and leading high-performing teams.