Taro Logo

Lead Site Reliability Engineer (Technical Duty Officer)

Xero helps supercharge businesses by automating routine tasks, surfacing actionable insights and connecting businesses with data, advisors and apps.
Melbourne VIC, AustraliaSydney NSW, AustraliaBrisbane QLD, Australia
Site Reliability
Staff Software Engineer
Hybrid
8+ years of experience
Enterprise SaaS

Job Description

Xero, a leading business automation and insights platform, is seeking a Lead Site Reliability Engineer to join their Incident and Problem Management team. This crucial role sits within the Site Reliability Engineering (SRE) organization and will be instrumental in building and maintaining robust incident management processes. The position combines technical leadership with incident response expertise, requiring an experienced SRE professional who can drive best practices across the business.

The role offers an exciting opportunity to shape Xero's SRE culture and lead technical responses to high-severity cloud issues. As a Technical Duty Officer (TDO), you'll serve as an incident commander, utilizing SRE skillsets to drive quick resolution of critical events. The position involves working with cutting-edge cloud technologies, particularly AWS, and requires strong coding abilities with a preference for Python.

The ideal candidate will bring a combination of technical expertise and leadership capabilities, with experience in both hands-on engineering and incident management. You'll be responsible for developing scalable processes, implementing observability strategies, and fostering a culture of continuous learning and technical excellence.

Xero offers an attractive benefits package including generous paid leave, comprehensive health coverage, and an Employee Share Plan. The company's commitment to work-life balance is evident through their flexible working arrangements and wellbeing programs. With locations across major Australian cities and a hybrid work model, this role provides the opportunity to work with a leading tech company while maintaining flexibility.

This is an excellent opportunity for an experienced SRE professional looking to make a significant impact in a company that's transforming how businesses operate. You'll be at the forefront of maintaining and improving the reliability of systems that thousands of businesses depend on daily.

Last updated a day ago

Responsibilities For Lead Site Reliability Engineer (Technical Duty Officer)

  • Own the incident management process for all products and services
  • Provide expert leadership during critical outages
  • Lead and advocate for SRE transformation within the organization
  • Develop and implement scalable process frameworks and observability strategies
  • Collaborate with product teams to analyze failures and improve service reliability
  • Promote customer-focused approach in addressing global environment issues

Requirements For Lead Site Reliability Engineer (Technical Duty Officer)

Python
Linux
  • Previous experience as a Site Reliability Engineer in Operations or Engineering
  • Strong hands-on coding experience (preferably Python)
  • Hands-on experience troubleshooting AWS hosted services
  • Networking knowledge (TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP)
  • Strong communication skills

Benefits For Lead Site Reliability Engineer (Technical Duty Officer)

Medical Insurance
Vision Insurance
Dental Insurance
Mental Health Assistance
Parental Leave
Equity
  • Generous paid leave
  • Health insurance
  • Life insurance
  • Income protection
  • Wellbeing and sports programmes
  • 26 weeks paid parental leave for primary caregivers
  • Employee Share Plan
  • Flexible working
  • Career development
  • Employee Assistance Program
  • Mental health care access for employees and family