Site Reliability Engineer II - Guidewire Cloud Platform (Application)

Guidewire delivers software for Property and Casualty insurance companies, providing core applications for policy management, claims settlement, and customer billing.
Curitiba, State of Paraná, Brazil
Site Reliability
Mid-Level Software Engineer
Hybrid
3+ years of experience
Enterprise SaaS · Finance

Description For Site Reliability Engineer II - Guidewire Cloud Platform (Application)

Guidewire is seeking a Site Reliability Engineer II to join their cloud platform team, focusing on ensuring the reliability and performance of their insurance software solutions. The role combines software engineering with operational expertise to support Guidewire's cloud-based insurance platform.

As an SRE, you'll be responsible for maintaining and improving the reliability of applications running on the Guidewire Cloud Platform. This involves troubleshooting complex systems, developing automated solutions, and working closely with development teams to optimize performance. The position requires participation in on-call rotations to ensure 24/7 service reliability.

Guidewire's platform is crucial for Property and Casualty (P&C) insurance companies worldwide, handling billions of dollars in business. The company's mission is to provide essential tools and technology that help insurers protect and support their customers during critical times, including natural disasters, accidents, and cyber risks.

The ideal candidate will bring strong technical skills in Linux administration, cloud technologies (particularly AWS), and programming languages like Python, Go, or Java. Experience with monitoring tools, CICD pipelines, and infrastructure automation is essential. The role offers opportunities for growth, learning cutting-edge technologies, and making a real impact in the insurance industry.

Working at Guidewire means joining a mission-driven company with a culture that values innovation, teamwork, and work-life balance. The company offers competitive compensation, comprehensive benefits, and career development opportunities while working with talented peers on technology that makes a difference in people's lives.

Last updated 6 days ago

Responsibilities For Site Reliability Engineer II - Guidewire Cloud Platform (Application)

  • Assist in troubleshooting and resolving issues in collaboration with development teams
  • Develop and maintain automated runbooks for proactive issue resolution
  • Monitor applications and improve reliability and performance on the Guidewire Cloud Platform
  • Optimize systems and reduce manual tasks using software engineering skills
  • Document incidents and refine processes to prevent future occurrences
  • Participate in on-call rotations
  • Apply engineering principles and automation to enhance operating environments

Requirements For Site Reliability Engineer II - Guidewire Cloud Platform (Application)

Python
Go
Java
Linux
Kubernetes
  • Experience as an SRE or similar role
  • Strong problem-solving skills
  • Linux system administration skills
  • Programming/scripting skills in Python, Go, Java, or shell
  • Understanding of SLIs, SLOs, and Error Budgets
  • Experience with APM and telemetry tools
  • Experience with troubleshooting distributed systems on cloud infrastructure
  • Experience with CICD pipelines within K8S
  • Experience with Datadog monitoring tools
  • Experience with AWS or Kubernetes using Terraform
  • Knowledge of infrastructure configuration management (GitOps, Puppet, or Ansible)
  • Understanding of AWS cloud networking and security

Interested in this job?

Jobs Related To Guidewire Site Reliability Engineer II - Guidewire Cloud Platform (Application)

Software Developer III, Site Reliability Development, Google Cloud

Site Reliability Developer role at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance.

Software Developer II, Site Reliability Development, Google Cloud

Site Reliability Development Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance.

Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Site Reliability Engineer position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Databases Site Reliability Engineer

Site Reliability Engineer position at Google focusing on database systems, requiring expertise in distributed systems, programming, and Linux/Unix administration.

Systems Engineer III, Site Reliability Engineering

Systems Engineer III position at Google focusing on Site Reliability Engineering, building and maintaining large-scale distributed systems with 2+ years of experience required.