Cloud Site Reliability Engineer II

Zafin offers a SaaS product and pricing platform that simplifies core modernization for top banks worldwide.
Thiruvananthapuram, Kerala, India
DevOps
Staff Software Engineer
In-Person
501 - 1,000 Employees
12+ years of experience
Finance · Enterprise SaaS

Description For Cloud Site Reliability Engineer II

Zafin, a leading SaaS product and pricing platform provider for banks worldwide, is seeking a Cloud Site Reliability Engineer II to join their team in Trivandrum, India. This role represents a unique opportunity to work with a company that serves major global banks including ING, CIBC, HSBC, and Wells Fargo. The position requires 12+ years of experience and deep expertise in cloud technologies, particularly Microsoft Azure.

As a CSRE II, you'll lead strategic initiatives ensuring the reliability, scalability, and performance of cloud infrastructure and applications. You'll be responsible for architecting solutions, managing complex technical issues, and mentoring junior engineers. The role combines technical leadership with hands-on work in container orchestration, automation, and monitoring solutions.

The ideal candidate will bring strong expertise in Azure, container orchestration systems (AKS/OpenShift), and database management (particularly Postgres). You'll need excellent problem-solving abilities and strong leadership skills to succeed in this role. The position offers competitive compensation, including annual bonuses, generous PTO, and comprehensive benefits.

Working at Zafin means joining a certified Great Place to Work® in Canada, India, and the UK, with a culture that values diversity and teamwork. The company's platform helps banks accelerate time to market for new products while lowering change costs and achieving tangible business outcomes. This role offers significant opportunity for professional growth and impact in a rapidly evolving fintech space.

Last updated 4 days ago

Responsibilities For Cloud Site Reliability Engineer II

  • Lead and manage complex technical issues resolution in Zafin's products and Azure cloud environment
  • Design and implement strategic operational enhancements for resiliency and system reliability
  • Conduct Root Cause Analysis (RCA) for high-severity incidents
  • Represent organization in external client escalation calls
  • Architect and optimize cloud infrastructure
  • Manage and scale container orchestration platforms (AKS and OpenShift)
  • Oversee advanced monitoring solutions implementation
  • Develop and execute automation strategies
  • Create and maintain cloud architecture documentation
  • Mentor and coach junior engineers
  • Drive strategic initiatives with cross-functional teams

Requirements For Cloud Site Reliability Engineer II

Python
Kubernetes
PostgreSQL
  • Bachelor's degree in Computer Science, Engineering, or related field (Master's preferred)
  • 12+ years of experience in cloud support, operations, or related role
  • Advanced expertise in Microsoft Azure
  • Experience in designing and scaling container orchestration systems
  • Proven leadership in managing automated deployment pipelines
  • Mastery in enterprise monitoring platforms
  • Advanced scripting skills with PowerShell, Python
  • Extensive experience in incident management
  • In-depth knowledge of database management, particularly Postgres
  • Exceptional analytical and problem-solving abilities
  • Strong leadership and mentoring skills

Benefits For Cloud Site Reliability Engineer II

Medical Insurance
  • Competitive salaries
  • Annual bonus potential
  • Generous paid time off
  • Paid volunteering days
  • Wellness benefits
  • Professional growth opportunities
  • Career advancement opportunities

Interested in this job?

Jobs Related To Zafin Cloud Site Reliability Engineer II

Cloud Site Reliability Engineer II

Lead cloud infrastructure and reliability initiatives at Zafin, implementing strategic solutions for a global banking software platform.

Staff Software Engineer (Developer Productivity)

Staff Software Engineer position at Okta focusing on developer productivity and infrastructure automation, offering competitive salary and comprehensive benefits in Toronto.

System Infrastructure Developer

Senior infrastructure development role at Apple focusing on silicon technology and CAD automation systems.

Senior Compute Site Reliability Engineer (GPU)

Senior SRE position at Apple focusing on GPU infrastructure, offering competitive pay, equity, and comprehensive benefits. Requires 5+ years of experience in SRE/DevOps with GPU expertise.

Sr Network Dev Engineer, Network Provisioning and Automation (Level 6)

Senior Network Development Engineer role at Amazon focusing on network automation and infrastructure provisioning for fulfillment networks, combining DevOps practices with network engineering expertise.