Taro Logo

Site Reliability Engineer II

World's first and only Omni Stack for banks and fintechs, rethinking payments from core to the edge.
Site Reliability
Mid-Level Software Engineer
In-Person
2+ years of experience
Finance

Job Description

Zeta, a unicorn fintech company founded in 2015, is seeking a Site Reliability Engineer II to join their team in Bangalore. As the world's first Omni Stack for banks and fintechs, Zeta is revolutionizing the payment industry from core to edge. This role offers an exciting opportunity to work with cutting-edge technology in a high-growth environment.

The ideal candidate will be responsible for ensuring system reliability through infrastructure design, automation, and monitoring. You'll work with modern tools and technologies including Kubernetes, cloud platforms, and Infrastructure as Code. The role requires a strong background in programming (Python, Go) and DevOps practices, with 2-4 years of relevant experience.

At Zeta, you'll be part of a team serving over 10 banks and 25 fintechs across 8 countries, including major clients like Sodexo and HDFC Bank. The company culture emphasizes personal growth with their core philosophy of 'People Must Grow.' You'll have the opportunity to work with some of the best minds in the industry while contributing to systems that process critical financial transactions.

The position offers hands-on experience with modern DevOps tools, participation in on-call rotations, and the chance to work on large-scale systems. You'll be involved in everything from performance optimization to disaster recovery planning, making this an excellent opportunity for engineers looking to grow their skills in system reliability and automation.

Last updated 6 days ago

Responsibilities For Site Reliability Engineer II

  • Ensuring the reliability of software systems by designing, implementing, and maintaining scalable infrastructure
  • Developing automation tools and scripts to streamline operational tasks
  • Monitoring system performance and responding to incidents
  • Analyzing system usage patterns and forecasting future capacity needs
  • Identifying and addressing performance bottlenecks
  • Implementing infrastructure as code practices
  • Implementing and maintaining monitoring and logging solutions
  • Participating in on-call rotation
  • Collaborating with security teams
  • Developing and maintaining disaster recovery plans
  • Continuously analyzing system performance and implementing improvements

Requirements For Site Reliability Engineer II

Python
Go
Kubernetes
  • 2-4 years of experience in site reliability engineering
  • B.Tech/M.Tech in computer science, information technology or related field
  • Proficiency in Python, Go, Shell, Bash
  • Experience with automation tools (Ansible, Puppet, Chef)
  • Experience with Docker and Kubernetes
  • Proficiency in cloud platforms (AWS, Azure, or Google Cloud)
  • Knowledge of monitoring tools (Prometheus, Grafana, ELK stack)
  • Understanding of networking concepts
  • Knowledge of security best practices
  • Experience with CI/CD pipelines
  • Proficient in Git version control

Related Jobs

Site Reliability Engineer II

Site Reliability Engineer II position at Zeta, focusing on maintaining and improving infrastructure reliability, automation, and system performance.

Site Reliability Engineer - II

Adobe is hiring a Site Reliability Engineer II in Bangalore to build and maintain scalable, reliable cloud infrastructure and services, requiring 4-8 years of experience with Linux, cloud platforms, and DevOps practices.

Site Reliability Developer

Site Reliability Developer position at Oracle focusing on cloud infrastructure, automation, and service reliability with 3-5+ years experience required.

Systems Engineer III, Site Reliability Engineering

Systems Engineer III position at Google focusing on Site Reliability Engineering, maintaining and improving large-scale distributed systems and enterprise applications.

Systems Engineer III, Site Reliability Engineering

Systems Engineer III position in Google's Site Reliability Engineering team, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.