Taro Logo

Site Reliability Engineer

Talent Worx is an emerging recruitment firm providing financial technology solutions to financial institutions, businesses, and developers.
Site Reliability
Senior Software Engineer
In-Person
5+ years of experience
Finance

Description For Site Reliability Engineer

Talent Worx, an emerging recruitment firm specializing in financial technology solutions, is seeking a Site Reliability Engineer to join their team. This role is critical in driving innovation and growth for Banking Solutions, Payments, and Capital Markets business. The position requires 5-8 years of experience and offers opportunities to make a lasting impact on the company's transformation journey.

The ideal candidate will be responsible for designing and maintaining monitoring solutions, implementing automation tools, ensuring system reliability, and leading incident response efforts. They will work closely with development, QA, DevOps, and product teams to maintain high availability and optimal performance of systems.

Key technical requirements include proficiency in cloud platforms (AWS, Azure, Google Cloud), monitoring tools (Prometheus, Grafana, DataDog), and scripting languages (Python, Bash). The role demands expertise in CI/CD pipelines, disaster recovery planning, and security best practices.

This position offers a chance to work with cutting-edge technologies while solving complex challenges in the financial technology sector. The ideal candidate should be action-oriented, have excellent interpersonal skills, and demonstrate strong attention to detail. They should also embody the firm's values of winning as one team, leading with integrity, and being the change.

The role is based in either Bengaluru or Chennai, India, requiring on-site presence. This is an excellent opportunity for an experienced SRE professional looking to make a significant impact in the fintech industry while working with a diverse range of technologies and stakeholders.

Last updated 4 days ago

Responsibilities For Site Reliability Engineer

  • Design and maintain monitoring solutions and alerting mechanisms for infrastructure
  • Implement automation tools and processes
  • Ensure reliability, availability, and performance of applications and services
  • Lead incident response efforts
  • Conduct capacity planning and performance tuning
  • Collaborate with security teams
  • Manage deployment pipelines and release processes
  • Create and maintain documentation and runbooks
  • Develop and test disaster recovery plans
  • Participate in on-call rotations
  • Collaborate with development, QA, DevOps, and product teams

Requirements For Site Reliability Engineer

Python
Linux
Kubernetes
  • Proficient in development technologies, architectures, and platforms
  • Experience in cloud platforms (AWS, Azure, Google Cloud)
  • Knowledge of monitoring tools (Prometheus, Grafana, DataDog, New Relic)
  • Experience in incident management
  • Strong troubleshooting skills
  • Proficiency in scripting languages (Python, Bash)
  • Experience in implementing CI/CD pipelines
  • Expertise in setting up monitoring solutions
  • Familiarity with APM tools
  • Familiarity with RUM (Real User Monitoring)
  • 5-8 years of experience

Jobs Related To Talent Worx Site Reliability Engineer