Taro Logo

Site Reliability Engineer (L4/5) - CORE

Netflix is one of the world's leading entertainment services with over 300 million paid memberships in 190+ countries offering TV series, films and games.
$100,000 - $720,000
Site Reliability
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
Enterprise SaaS · Entertainment

Job Description

Netflix, a global entertainment leader with 300M+ subscribers across 190+ countries, is seeking a Site Reliability Engineer for their Critical Operations and Reliability Engineering team. This role is crucial in driving customer satisfaction by managing risk and minimizing impact across Netflix's streaming platform.

The position offers an exciting opportunity to work with large-scale distributed systems and contribute to Netflix's world-class streaming infrastructure. As an SRE, you'll be responsible for designing and implementing scalable infrastructure, developing automation tools, and participating in incident response. The role requires a blend of coding expertise, system architecture knowledge, and strong problem-solving abilities.

The ideal candidate brings 5+ years of SRE experience and proficiency in languages like Python, Go, or Java. You should be well-versed in cloud infrastructure (AWS/Azure/GCP), Infrastructure as Code, and container orchestration systems like Kubernetes. Strong communication skills and the ability to influence across teams are essential, as you'll be collaborating with various engineering groups to promote reliability practices.

Netflix offers a unique culture that values innovation, excellence, and inclusion. The compensation package is highly competitive, ranging from $100,000 to $720,000, with the flexibility to choose between salary and stock options. The company provides comprehensive benefits including health coverage, mental health support, 401(k) with employer match, and generous time-off policies.

Working at Netflix means joining a team that's passionate about entertainment and technology, with the opportunity to impact millions of users worldwide. The role offers remote work flexibility while maintaining a collaborative environment focused on solving complex technical challenges at scale.

Last updated 5 days ago

Responsibilities For Site Reliability Engineer (L4/5) - CORE

  • Design, implement, and maintain scalable and reliable infrastructure to support Netflix Streaming Suite
  • Collaborate with engineering and product teams to integrate observability, reliability, and security considerations
  • Develop and implement automation tools for monitoring, deployment, and incident response
  • Participate in on-call rotations to ensure 24/7 health of Netflix Streaming
  • Implement and maintain robust incident response framework
  • Proactively identify sources of instability in distributed systems
  • Champion and embed a culture of reliability across the Ads organization

Requirements For Site Reliability Engineer (L4/5) - CORE

Python
Go
Java
Kubernetes
  • 5+ years of experience as a Site Reliability Engineer, Production Engineer, or similar role
  • Proficiency in Python, Go, or Java
  • Hands-on experience with cloud providers (AWS/Azure/GCP)
  • Experience with Infrastructure as Code such as Terraform
  • Understanding of large-scale distributed systems
  • Excellent communication and collaboration skills
  • Experience with incident management and response
  • Natural troubleshooting abilities
  • Growth mindset and continuous improvement attitude

Benefits For Site Reliability Engineer (L4/5) - CORE

Medical Insurance
Mental Health Assistance
401k
Vision Insurance
Dental Insurance
Parental Leave
  • Health Plans
  • Mental Health support
  • 401(k) Retirement Plan with employer match
  • Stock Option Program
  • Disability Programs
  • Health Savings and Flexible Spending Accounts
  • Family-forming benefits
  • Life and Serious Injury Benefits
  • 35 days annually for paid time off (hourly employees)
  • Flexible time off (salaried employees)

Related Jobs