Staff Site Reliability Engineer - Technical Duty Officer

Zscaler is the operator of the world's largest security cloud, accelerating digital transformation for enterprises to be more agile, efficient, resilient, and secure.
$136,500 - $195,000
Site Reliability
Staff Software Engineer
Remote
5,000+ Employees
5+ years of experience
Cybersecurity
This job posting may no longer be active. You may be interested in these related jobs instead:
Staff Site Reliability Engineer

Staff Site Reliability Engineer position at Fivetran, focusing on infrastructure reliability, monitoring, and system evolution with hybrid work in Denver.

Production Support Engineering LMTS

Senior SRE position at Salesforce focusing on cloud infrastructure reliability, requiring U.S. citizenship and extensive experience with AWS, Kubernetes, and monitoring tools.

Site Reliability Engineer

Microsoft Site Reliability Engineer position in Cloud+AI team, focusing on secure infrastructure and Azure services deployment, offering hybrid work and competitive compensation.

Site Reliability Developer 3

Site Reliability Developer role at Oracle focusing on cloud infrastructure, automation, and system reliability with emphasis on security and scalability.

Site Reliability Developer 3

Site Reliability Developer role at Oracle focusing on cloud infrastructure, automation, and system reliability with emphasis on security and scalability.

Description For Staff Site Reliability Engineer - Technical Duty Officer

Zscaler, a leading cloud security company, is seeking a Staff Site Reliability Engineer-Technical Duty Officer to join their Shared Platform Engineer team. This role involves leading the transformation to a world-leading SRE organization, providing expert leadership during critical outages, promoting customer-focused approaches, developing scalable process frameworks, and collaborating with product teams to improve service reliability.

Key responsibilities include:

  • Advocating for SRE principles within the Engineering Department
  • Coordinating multiple teams during critical outages for streamlined decision-making and quick resolution
  • Addressing and mitigating global customer environment issues
  • Fostering a culture of continuous learning and technical excellence
  • Implementing observability strategies for rapid problem diagnosis and response
  • Analyzing failures and integrating insights to improve service reliability, scalability, and operational efficiency

The ideal candidate will have:

  • 5+ years of experience as a Site Reliability Engineer
  • Hands-on experience troubleshooting Linux-based systems
  • Strong networking knowledge (TCP/IP, SSL/TLS, DNSSEC, IPsec, BGP)
  • Coding experience, preferably in Python
  • Bachelor's degree in Computer Science or related field

Preferred qualifications:

  • Experience supporting High/Moderate FedRAMP environments
  • Understanding of Observability practices and tools (Grafana, DataDog, Splunk, etc.)
  • Experience leading major incidents in large scale, high uptime environments

This role offers remote work options, with a preference for the Eastern Time Zone. Zscaler provides comprehensive benefits, including various health plans, time off, parental leave, retirement options, and education reimbursement.

Join Zscaler's Engineering team and contribute to building and innovating the world's largest cloud security platform, serving thousands of enterprise customers globally.

Last updated 8 months ago

Responsibilities For Staff Site Reliability Engineer - Technical Duty Officer

  • Lead and advocate for the transformation to a world-leading SRE organization
  • Provide expert leadership during critical outages
  • Promote a customer-focused approach
  • Develop and implement scalable process frameworks and observability strategies
  • Collaborate with product teams to analyze failures and improve service reliability

Requirements For Staff Site Reliability Engineer - Technical Duty Officer

Linux
Python
  • 5+ years of experience as a Site Reliability Engineer
  • Hands-on experience troubleshooting Linux-based systems
  • Networking knowledge (TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP)
  • Coding experience (preferably Python)
  • Bachelor's degree in Computer Science or related field

Benefits For Staff Site Reliability Engineer - Technical Duty Officer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks

Interested in this job?