Taro Logo

Site Reliability Engineer

Renmoney is a financial technology company providing innovative solutions in Nigeria.
Lagos, Nigeria
DevOps
Mid-Level Software Engineer
Hybrid
3+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer

Renmoney is seeking a Site Reliability Engineer to join their IT department in a hybrid work environment. This role focuses on ensuring the availability and reliability of UAT and production applications, as well as improving the entire lifecycle of services. The ideal candidate will have experience with databases, configuration management tools, containerization, and monitoring systems. They will be responsible for troubleshooting complex issues, implementing security measures, and scaling systems through automation. This position offers the opportunity to work with cutting-edge technologies and solve real-world challenges in a dynamic fintech environment. The successful candidate will join a team of amazing people in a flat organizational structure, contributing to the growth and success of Renmoney's digital infrastructure.

Last updated a year ago

Responsibilities For Site Reliability Engineer

  • Ensuring availability of UAT and production applications and foster capacity planning for production infrastructures
  • Monitoring of existing systems/applications using monitoring tools
  • Engage in and improve the whole lifecycle of services from inception and design, through deployment, operations
  • Troubleshooting problems that span systems, databases, storage, network, and codes
  • Suggesting/implementing security measures for the protection of systems, networks, and information
  • Scale systems sustainably through mechanisms like automation
  • Evolve systems by pushing for changes that improve reliability and velocity
  • Minimize and mitigate the risk of reliability-related failures pertaining to systems availability, performance, and correctness
  • Ensuring investigation into warnings and alerts from monitoring systems
  • Incident response, diagnosis, and follow-up on system outages
  • Documentation of process and procedure manuals

Requirements For Site Reliability Engineer

Kubernetes
Kafka
Redis
Python
Go
Ruby
Linux
  • Working knowledge of databases and SQL
  • Minimum of 3 years work experience
  • Comfortable with Open-Source configuration management and orchestration tools (chef, Puppet, Ansible, Terraform, etc.)
  • Knowledge of Docker, Docker swamp, Fargate, and Kubernetes
  • Experience with caching systems such as Kafka and Redis
  • Working experience with building monitoring tools and setting measurement metrics
  • Proficiency with shell and a programming language used in an SRE/Operations engineering context (Python, Go, Ruby, etc.)
  • Experience with operating in a high availability environment
  • Excellent communication skills with a high level of emotional intelligence
  • Experience in working with remote teams
  • Server Administration skills (Redhat, Windows, CentOs, Ubuntu)

Benefits For Site Reliability Engineer

  • Competitive compensation
  • Work with amazing people
  • Work in a beautiful environment
  • Flat structure
  • Solve complex, real-world challenges

Interested in this job?