Taro Logo

Staff Site Reliability Engineer

A leading restaurant management ecosystem and payment tech provider founded in 2014, processing over 6 billion orders across 35+ countries.
Site Reliability
Staff Software Engineer
In-Person
501 - 1,000 Employees
8+ years of experience
Enterprise SaaS · Finance

Job Description

Foodics, a leading restaurant management and payment tech provider, is seeking a Staff Site Reliability Engineer to join their high-impact engineering team. Founded in 2014 and headquartered in Riyadh, Foodics has successfully processed over 6 billion orders across 35+ countries, making it one of MENA's fastest-growing SaaS companies with $170M in funding.

The role focuses on ensuring the scalability, performance, and reliability of Foodics' cloud-native platforms. You'll be responsible for designing and implementing infrastructure solutions, managing incident responses, and establishing best practices in observability and resilience engineering. The position requires expertise in Kubernetes, cloud providers (AWS, OCI), infrastructure as code, and monitoring systems.

Key responsibilities include maintaining highly available systems, leading incident responses, implementing observability frameworks, and driving chaos engineering experiments. The ideal candidate should have strong SRE principles knowledge, hands-on experience with distributed systems, and excellent troubleshooting skills.

Working at Foodics offers competitive compensation with equity potential, professional development opportunities, and the chance to work with a diverse global team. You'll be at the forefront of cloud technologies while directly impacting platforms serving millions of users. The role combines technical challenges with business impact, making it perfect for engineers passionate about reliability and scalability in high-growth environments.

Last updated 18 days ago

Responsibilities For Staff Site Reliability Engineer

  • Design and maintain scalable, highly available, and fault-tolerant systems across multiple cloud providers (AWS, OCI)
  • Lead incident response efforts, conducting blameless post-mortems and driving systemic improvements
  • Build and refine automated deployment pipelines
  • Implement robust observability frameworks
  • Collaborate with development teams to embed reliability
  • Optimize infrastructure costs while maintaining service quality
  • Drive chaos engineering experiments
  • Document architecture, runbooks, and operational processes

Requirements For Staff Site Reliability Engineer

Kubernetes
Python
Go
MySQL
PostgreSQL
MongoDB
Redis
  • Strong background in SRE principles (SLIs, SLOs, SLAs) and operational excellence
  • Experience with Kubernetes, container orchestration, and service mesh technologies
  • Expertise in infrastructure as code and automation scripting
  • Deep understanding of monitoring and alerting systems
  • Skilled in cloud networking, load balancing, API gateway management
  • Solid experience with relational and NoSQL databases in production
  • Familiarity with distributed tracing and chaos testing frameworks
  • Excellent troubleshooting skills and ability to resolve high-impact incidents under pressure

Benefits For Staff Site Reliability Engineer

Equity
Education Budget
  • Highly competitive compensation packages including bonuses and potential equity
  • Annual learning stipend and regular training
  • Exposure to cutting-edge cloud technologies
  • Global team of over 30 nationalities in 14 countries
  • Autonomy and challenging goals

Related Jobs