Taro Logo

Site Reliability Engineer (SRE)

Cairo, Cairo Governorate, EgyptAlexandria, Alexandria Governorate, EgyptRiyadh Saudi Arabia
Site Reliability
Mid-Level Software Engineer
Remote
3+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer (SRE)

Lucidya is seeking a Site Reliability Engineer (SRE) to join their Cloud Engineering team. This role focuses on enhancing the reliability, scalability, and automation of cloud-based infrastructure. The ideal candidate will work with cloud environments, containerized workloads, and monitoring systems. Key responsibilities include managing high availability infrastructure, cloud operations, Kubernetes clusters, and implementing monitoring solutions. The role requires expertise in cloud platforms, Infrastructure as Code, and automation tools. The position offers the flexibility of remote work with opportunities in multiple locations across Egypt and Saudi Arabia. This is an excellent opportunity for a mid-level engineer with around 3 years of experience to make a significant impact on infrastructure reliability and system performance. The role combines technical expertise with collaborative teamwork, making it ideal for someone passionate about infrastructure automation and system reliability.

Last updated 2 months ago

Responsibilities For Site Reliability Engineer (SRE)

  • Ensure high availability and scalability of critical infrastructure components
  • Proactively identify and eliminate single points of failure
  • Handle Linux systems administration tasks
  • Manage and optimize cloud-based workloads
  • Automate provisioning, scaling, and maintenance tasks
  • Manage Kubernetes clusters operations
  • Implement and standardize monitoring solutions
  • Participate in on-call rotations and troubleshoot incidents
  • Develop and maintain automation scripts
  • Work with DevOps and Engineering teams to resolve performance issues

Requirements For Site Reliability Engineer (SRE)

Python
Kubernetes
Linux
  • 3 years of experience in SRE, DevOps, or Infrastructure Engineer role
  • Strong experience with major cloud provider (AWS, GCP, or Azure)
  • Hands-on experience with Kubernetes and containerization
  • Proficient in scripting languages (Python, Bash)
  • Familiarity with Infrastructure as Code tools
  • Strong understanding of load balancers and networking
  • Experience with CI/CD tools
  • Experience with monitoring and observability tools
  • Strong analytical and problem-solving skills
  • Excellent communication and collaboration skills

Interested in this job?