Taro Logo

Site Reliability Engineer

A travel technology company rebuilding the infrastructure that underpins the travel industry, backed by Y Combinator, Benchmark, Blossom, Index Ventures and Kima Ventures.
Site Reliability
Senior Software Engineer
Hybrid
5+ years of experience
Travel

Job Description

Duffel is revolutionizing the travel industry by building modern infrastructure to simplify travel distribution, search, and booking. As a Site Reliability Engineer, you'll join a dynamic team backed by prestigious investors like Benchmark and Index Ventures. You'll be responsible for maintaining and improving the reliability, performance, and resilience of Duffel's infrastructure and applications.

The role involves working with cutting-edge technologies including GCP, Kubernetes, and OpenTelemetry. You'll be handling critical infrastructure components, managing high-availability metrics collection systems, and overseeing data pipelines. The team is currently focused on improving reliability monitoring and implementing OpenTelemetry with Honeycomb for better production insights.

Future challenges include expanding to multiple regions globally and improving deployment strategies. You'll be working in a collaborative environment using tools like Elixir, Phoenix, and various GCP services. The position offers significant technical challenges, from debugging complex configuration issues to architecting multi-regional infrastructure.

As part of the team, you'll contribute to building tools that will make the future of travel effortless, serving over 4 billion airline passengers. The company offers equity ownership and is committed to personal growth, maintaining an inclusive environment that values diverse perspectives and problem-solving abilities.

Last updated 3 months ago

Responsibilities For Site Reliability Engineer

  • Ensure reliability, performance, and resilience of infrastructure and applications
  • Work closely with engineering teams to understand and meet their needs
  • Manage infrastructure on Google Cloud Platform
  • Maintain PCI Cardholder Data Environment
  • Manage infrastructure using Terraform and GitOps
  • Oversee metrics collection system and telemetry
  • Handle data pipeline management
  • Drive reliability monitoring improvements across engineering

Requirements For Site Reliability Engineer

Kubernetes
PostgreSQL
Redis
  • Infrastructure and systems engineering generalist experience
  • Software development and systems engineering skills
  • High standards for code and configuration quality
  • Good understanding of observability and reliability practices
  • Experience in incident response
  • Strong communication skills
  • Collaborative mindset
  • Experience with cloud platforms (preferably GCP)

Benefits For Site Reliability Engineer

Equity
  • Company equity ownership
  • Focus on personal growth
  • Inclusive work environment

Related Jobs

Senior Software Engineer SRE

Senior SRE position at Spire Global focusing on maintaining and improving reliability of satellite constellation operations through software automation and monitoring.

Site Reliability Engineer

Senior Site Reliability Engineer position at LexisNexis IP, focusing on cloud infrastructure, Kubernetes, and DevOps practices in Farringdon, UK.

Site Reliability Engineer

Senior Site Reliability Engineer position at Gizmo in London, focusing on scaling systems for millions of users with hybrid work arrangement.

Senior Site Reliability Engineer — AI Studio (Inference Platform)

Senior SRE position at Nebius focusing on AI infrastructure, requiring expertise in Kubernetes, observability, and GPU optimization for large-scale inference platforms.

Software Engineer, Backend

Senior Backend Software Engineer role at Duffel, building APIs to revolutionize travel booking. 6+ years experience required. Based in London. £80K-£90K.