Taro Logo

Site Reliability Engineer (L4/L5)

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
Enterprise SaaS · Entertainment
This job posting may no longer be active. You may be interested in these related jobs instead:
Site Reliability Developer (Join OCI Ns2)

Senior Site Reliability Developer position at Oracle focusing on building and maintaining large-scale distributed systems with emphasis on security, resiliency, and performance.

Site Reliability Engineer

Senior Site Reliability Engineer position at Wheely, focusing on infrastructure security, monitoring, and DevOps practices in Nicosia, Cyprus.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Adobe working on Identity Services, focusing on scalability, reliability and zero downtime for systems handling millions of requests.

Site Reliability Engineer

Senior Site Reliability Engineer position at Bounteous in Montreal, focusing on system reliability, ServiceNow administration, and operational excellence in a hybrid work environment.

Senior Site Reliability Engineer

Senior SRE role at Oracle focusing on designing and managing scalable infrastructure for enterprise applications using OCI across multiple regions.

Description For Site Reliability Engineer (L4/L5)

Netflix, a global entertainment leader serving 283 million subscribers across 190+ countries, is seeking a Site Reliability Engineer (L4/L5) to join their N-Tech SRE team in Warsaw. This role focuses on enhancing the reliability and resilience of Netflix's internal services used by employees worldwide. As an SRE, you'll work on implementing best practices, automation, and proactive measures to maintain system reliability. The position requires expertise in distributed systems, strong programming skills in languages like Python, Go, or Java, and experience with cloud platforms and infrastructure as code. You'll participate in on-call rotations, incident response, and collaborate with cross-functional teams to improve service availability. The role offers the opportunity to work with cutting-edge technology at scale, implement robust monitoring systems, and drive improvements in system reliability. Netflix values diversity and inclusion, offering a collaborative environment where you can leverage your unique experiences to solve complex technical challenges. The position is ideal for candidates passionate about reliability engineering and interested in working with complex distributed systems at one of the world's leading streaming platforms.

Last updated 9 hours ago

Responsibilities For Site Reliability Engineer (L4/L5)

  • Design, implement, and maintain scalable and reliable infrastructure
  • Collaborate with engineering and product teams on observability, reliability, and security
  • Develop and implement automation tools for monitoring, deployment, and incident response
  • Conduct capacity planning, performance analysis, and system tuning
  • Participate in on-call rotations and incident response
  • Implement and improve monitoring and alerting systems
  • Implement and maintain disaster recovery plans
  • Evaluate and recommend improvements for system observability
  • Identify sources of instability in distributed systems
  • Engage with product teams to diagnose operational issues
  • Implement incident response framework
  • Champion continuous learning culture

Requirements For Site Reliability Engineer (L4/L5)

Python
Go
Java
JavaScript
Node.js
Kubernetes
  • 3+ years of experience as a Site Reliability Engineer or similar role
  • Strong scripting and programming skills (Python, Go, Java or JavaScript/Node.js)
  • Experience with complex sociotechnical systems
  • Experience with incident management and response
  • Experience with Infrastructure as code like Terraform and Kubernetes
  • Experience with cloud platforms like AWS, microservices architecture
  • Excellent communication & collaboration skills
  • Proven ability to cultivate relationships through influence
  • Proven ability to troubleshoot complex issues
  • Familiarity with Human Factors Engineering
  • Ability to grow expertise, influence & educate others

Interested in this job?