Cloud Site Reliability Engineer II

SaaS product and pricing platform provider that simplifies core modernization for top banks worldwide.
Site Reliability
Staff Software Engineer
Hybrid
12+ years of experience
Finance · Enterprise SaaS

Description For Cloud Site Reliability Engineer II

Zafin, a leading SaaS product and pricing platform provider for banks worldwide, is seeking a Cloud Site Reliability Engineer II to join their team in Toronto. This advanced role requires a seasoned professional with 12+ years of experience to lead strategic initiatives in ensuring the reliability, scalability, and performance of cloud infrastructure and applications. The position offers a unique opportunity to work with major global banks while driving innovative solutions and operational excellence.

The role demands expertise in Microsoft Azure, container orchestration systems like AKS and OpenShift, and advanced monitoring solutions. You'll be responsible for architecting cloud infrastructure, managing high-severity incidents, and mentoring junior engineers. The position combines technical leadership with strategic planning, making it ideal for someone who wants to influence cloud reliability strategies while working with cutting-edge technologies.

Zafin offers a competitive compensation package including annual bonuses, comprehensive benefits, and significant professional growth opportunities. The company is recognized as a top employer and certified Great Place to Work® in multiple countries. Working in a hybrid model, you'll be part of a diverse, collaborative team that values innovation and quality work.

The company's impressive client roster includes major financial institutions like ING, CIBC, HSBC, Wells Fargo, PNC, and ANZ. This role provides an excellent opportunity to work on solutions that help banks accelerate their time to market for new products while enabling personalized pricing and dynamic responses to market needs.

Last updated 10 hours ago

Responsibilities For Cloud Site Reliability Engineer II

  • Lead and manage complex technical issues resolution in Zafin's products and Azure cloud environment
  • Design and implement strategic operational enhancements for resiliency and system reliability
  • Conduct Root Cause Analysis (RCA) for high-severity incidents
  • Represent organization in external client escalation calls
  • Architect and optimize cloud infrastructure
  • Manage and scale container orchestration platforms (AKS and OpenShift)
  • Implement advanced monitoring solutions
  • Develop and execute automation strategies
  • Create and maintain cloud architecture documentation
  • Mentor junior engineers
  • Drive strategic initiatives with cross-functional teams

Requirements For Cloud Site Reliability Engineer II

Python
Kubernetes
  • Bachelor's degree in Computer Science, Engineering, or related field (Master's preferred)
  • 12+ years of experience in cloud support, operations, or related role
  • Advanced expertise in Microsoft Azure or equivalent cloud platforms
  • Experience in designing and scaling container orchestration systems
  • Proven leadership in managing automated deployment pipelines
  • Mastery in enterprise monitoring platforms
  • Advanced scripting skills with PowerShell, Python, or similar languages
  • Extensive experience in incident management
  • In-depth knowledge of database management, particularly Postgres

Benefits For Cloud Site Reliability Engineer II

Medical Insurance
  • Annual bonus potential
  • Generous paid time off
  • Paid volunteering days
  • Wellness benefits
  • Professional growth opportunities

Interested in this job?

Jobs Related To Zafin Cloud Site Reliability Engineer II

Cloud Site Reliability Engineer I

Cloud Site Reliability Engineer I position at Zafin, responsible for ensuring seamless operation of cloud infrastructure and applications.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Microsoft Security, focusing on building and managing critical infrastructure for red team operations with emphasis on security and automation.

Cloud Site Reliability Engineer I

Cloud Site Reliability Engineer I position at Zafin, responsible for ensuring seamless operation of cloud infrastructure and applications.

Lead Site Reliability Engineer (Product SRE)

Lead Site Reliability Engineer position at Xero, focusing on driving reliability, observability, and high-performing services across product teams.

Staff Site Reliability Engineer

Staff Site Reliability Engineer position at Assured, offering $180K-$210K with equity, focusing on building and scaling infrastructure for insurance claims processing platform.