Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

World's leading entertainment service with 283 million paid memberships in over 190 countries, offering TV series, films and games.
$100,000 - $720,000
Site Reliability
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
Entertainment · Enterprise SaaS

Description For Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Netflix, the global entertainment leader serving 283 million subscribers worldwide, is seeking a Site Reliability Engineer for their Live Cloud Platform. This role is crucial in supporting Netflix's expansion into live content streaming, including events like the SAG Awards and sports matches. As an SRE, you'll be at the forefront of ensuring seamless live streaming experiences for millions of viewers globally.

The position focuses on managing cloud traffic infrastructure, specifically working with API Gateway and inter-process communication between microservices. You'll be responsible for implementing robust solutions to handle sudden traffic spikes, particularly during live event launches. The role combines deep technical expertise in cloud infrastructure with practical problem-solving skills.

This is an excellent opportunity for experienced engineers passionate about large-scale systems and live streaming technology. You'll work with cutting-edge technologies including Go, Python, Rust, and various big data processing tools. The position offers competitive compensation ($100,000-$720,000) with the flexibility to choose between salary and stock options.

Netflix offers an inclusive, innovative culture with comprehensive benefits including health coverage, mental health support, and flexible time off. The remote work option provides flexibility while maintaining connection with the team in Los Gatos, CA. If you're excited about solving complex technical challenges in live streaming at global scale, this role offers the perfect blend of technical depth and real-world impact.

Last updated 18 minutes ago

Responsibilities For Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

  • Drive continual improvement in observability, monitoring, and scalability for live streaming cloud traffic
  • Implement, automate, execute, and analyze results from live streaming delivery testing
  • Write and review code, develop documentation, and debug complex problems
  • Coordinate and collaborate with stakeholders for live-streaming events execution
  • Participate in on-call rotation with flexible hours based on live events schedule

Requirements For Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Go
Python
Rust
Linux
Kafka
  • 5+ years of service reliability/operational experience with large-scale systems
  • Knowledge of L4 Load Balancer, HTTP cache, and reverse proxy technologies
  • Expert-level knowledge of Unix or Linux systems and TCP/IP network fundamentals
  • Proficient understanding of networking principles and protocols (DNS, TLS, HTTP(s))
  • Proficient in programming languages like Go, Python, Rust
  • Experience with real-time and BigData analytic processing technologies
  • Strong collaboration and communication skills
  • Preferred - B.S. in Computer Science, Electrical or Computer Engineering or equivalent experience

Benefits For Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Medical Insurance
Mental Health Assistance
401k
Vision Insurance
Dental Insurance
Parental Leave
  • Comprehensive Health Plans
  • Mental Health support
  • 401(k) Retirement Plan with employer match
  • Stock Option Program
  • Disability Programs
  • Health Savings and Flexible Spending Accounts
  • Family-forming benefits
  • Life and Serious Injury Benefits
  • Paid leave of absence programs
  • 35 days annually for paid time off (hourly employees)
  • Flexible time off (salaried employees)

Interested in this job?

Jobs Related To Netflix Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Oracle, focusing on cloud infrastructure services and automation with 3-5+ years experience required.

Site Reliability Engineer - Database

Senior Site Reliability Engineer position at Oracle focusing on Database Autonomous Recovery Service, requiring TS/SCI clearance and extensive cloud infrastructure experience.

Operations Site Reliability Engineer

Senior Site Reliability Engineer role at Broadcom focusing on maintaining and optimizing production services, automation, and system administration.

Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia

Senior Site Reliability Engineer position at Bank of America in Sydney, focusing on cloud platform reliability, automation, and DevOps practices.

Sr. Site Reliability Engineer

Senior Site Reliability Engineer position at Broadcom focusing on cloud infrastructure and SaaS platform operations.