Taro Logo

Site Reliability Engineer III

Guidewire delivers software for Property and Casualty insurance companies, enabling them to protect customers during crises through core applications and cloud services.
Curitiba, State of Paraná, Brazil
Site Reliability
Senior Software Engineer
Hybrid
1,000 - 5,000 Employees
5+ years of experience
Enterprise SaaS · Finance

Job Description

Guidewire, a leading provider of software solutions for the Property and Casualty (P&C) insurance industry, is seeking a Site Reliability Engineer III to join their SRE-Application team. This role is crucial in ensuring the reliability, performance, and scalability of applications running on the Guidewire Cloud Platform.

The position offers a unique blend of software engineering and operations, focusing on maintaining and improving critical insurance industry applications that handle billions of dollars in business. As an SRE III, you'll be responsible for troubleshooting complex distributed systems, developing automated solutions, and ensuring the platform's reliability through monitoring and optimization.

The ideal candidate will bring strong software engineering expertise in languages like Python, Go, or Java, combined with deep knowledge of cloud infrastructure, particularly AWS and Kubernetes. You'll work with cutting-edge technology and tools like Terraform, Datadog, and various CI/CD pipelines while implementing SLIs, SLOs, and Error Budgets.

This role requires participation in on-call rotations, making it suitable for professionals committed to ensuring 24/7 service reliability. You'll join a company culture that values integrity, rationality, and collegiality, working alongside talented peers who are passionate about transforming the insurance industry through technology.

Guidewire offers competitive compensation, comprehensive benefits, and significant career development opportunities. With over 540+ insurers in 40 countries relying on their platform, this role provides an opportunity to make a real impact in an industry that helps protect and support people during critical moments.

Last updated 4 days ago

Responsibilities For Site Reliability Engineer III

  • Work with development teams to troubleshoot and resolve issues
  • Develop and maintain automated runbooks
  • Monitor and improve reliability and performance of applications
  • Optimize systems and reduce manual toil
  • Document incidents and develop prevention processes
  • Participate in on-call rotations
  • Foster culture of innovation and continuous improvement

Requirements For Site Reliability Engineer III

Python
Go
Java
Kubernetes
  • Experience as an SRE or similar role
  • Software engineering background with Python, Go, or Java
  • Experience with SLIs, SLOs, and Error Budgets
  • Experience with APM and telemetry tools
  • Experience troubleshooting distributed systems
  • Experience with CICD pipelines in K8S
  • Experience with Datadog monitoring
  • Experience with AWS and Kubernetes using Terraform
  • Experience with infrastructure configuration management
  • Understanding of cloud networking and security

Benefits For Site Reliability Engineer III

Medical Insurance
  • Competitive compensation
  • Comprehensive benefits
  • Career development opportunities
  • Work-life balance

Related Jobs