Director, Software Engineering, Site Reliability

LinkedIn is the world's largest professional network, built to create economic opportunity for every member of the global workforce.
Site Reliability
Principal Software Engineer
Hybrid
5,000+ Employees
15+ years of experience
Enterprise SaaS

Description For Director, Software Engineering, Site Reliability

LinkedIn is the world's largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exciting opportunities, build necessary skills, and gain valuable insights every day. We're committed to providing transformational opportunities for our own employees by investing in their growth.

As Director of Site Reliability Engineering, you will lead a strategic team of 40+ engineers focused on ensuring the reliability of LinkedIn's critical infrastructure systems. The role involves overseeing streaming, batch processing, data lake and online serving document store ecosystems.

Key Focus Areas:

  • Setting strategic vision and technical direction for infrastructure reliability
  • Building high-performing SRE teams and driving innovation culture
  • Implementing API-first approaches for infrastructure lifecycle management
  • Developing automated remediation systems
  • Providing deep observability into infrastructure health
  • Leading cloud platform initiatives across AWS/Azure/GCP
  • Driving infrastructure-as-code practices
  • Managing large-scale Kubernetes deployments

This is a unique opportunity to shape the reliability of systems at massive scale while working with cutting-edge technologies. The role offers significant scope for technical leadership and organizational impact through:

  • Mentoring and growing engineering talent
  • Driving architectural decisions for next-gen infrastructure
  • Building partnerships across engineering teams
  • Leading transformation initiatives

The ideal candidate will bring deep technical expertise in distributed systems, proven leadership experience, and a passion for building reliable, scalable infrastructure. This role provides an excellent platform to drive innovation while working with world-class engineering teams in a culture focused on trust, care, inclusion and fun.

Last updated a day ago

Responsibilities For Director, Software Engineering, Site Reliability

  • Set strategic vision and technical direction for infrastructure reliability organization
  • Build and maintain high-performing engineering team for SREs
  • Drive APIs first approach for operations of infrastructure life cycle management
  • Provide observability and deep actionable insights into infrastructure health
  • Lead automated remediations to restore desired state using infrastructure as code
  • Manage and scale team of 40+ engineers
  • Drive culture of continuous improvement and innovation

Requirements For Director, Software Engineering, Site Reliability

Kubernetes
Python
Go
Java
Ruby
Kafka
  • BA/BS degree in Computer Science or related field
  • 15+ years in Engineering leadership focused on dev/ops based roles leading teams of engineers of size 40+
  • 8+ years of building software to simplifying operations and reducing toil in managing large scale infrastructure
  • 8+ years experience in cloud data infrastructure, Apache Kafka, Apache Flink, Apache Hadoop infrastructure
  • Proficiency in cloud computing platforms (AWS, Azure, GCP)
  • Experience with Linux-based systems and container orchestration (Kubernetes)
  • Background in hands-on development in programming/scripting languages (Python, Go, Java or Ruby)
  • Experience attracting, retaining, and developing top engineering talent
  • Excellent communication and interpersonal skills

Benefits For Director, Software Engineering, Site Reliability

Medical Insurance
  • Health and wellness programs
  • Time away policies

Interested in this job?

Jobs Related To LinkedIn Director, Software Engineering, Site Reliability

Director, Software Engineering, Site Reliability

Lead LinkedIn's Site Reliability Engineering team in Bangalore, driving infrastructure reliability and automation for the world's largest professional network.

Director, Software Engineering, Site Reliability

Lead a 40+ person Site Reliability Engineering team at LinkedIn Bengaluru, focusing on infrastructure reliability, automation, and system scalability.

Director, Software Engineering, Site Reliability

Lead LinkedIn's Site Reliability Engineering team in Bengaluru, directing 40+ engineers and driving infrastructure reliability for critical systems.

Director, Software Engineering, Site Reliability

Lead LinkedIn's Site Reliability Engineering team in Bengaluru, directing 40+ engineers to ensure reliability of critical infrastructure systems while driving innovation and operational excellence.

Director, Software Engineering, Site Reliability

Lead a 40+ SRE team at LinkedIn Bengaluru, driving infrastructure reliability and automation for world's largest professional network. 15+ years leadership experience required.