Taro Logo

Site Reliability Engineer II

A fintech unicorn founded in 2015 providing the world's first Omni Stack for banks and fintechs, offering modern software stack for retail banking.
Site Reliability
Mid-Level Software Engineer
In-Person
501 - 1,000 Employees
2+ years of experience
Finance · Enterprise SaaS

Job Description

Zeta, a pioneering fintech unicorn founded in 2015, is seeking a Site Reliability Engineer II to join their team in Bangalore. As the world's first Omni Stack provider for banks and fintechs, Zeta is revolutionizing the financial technology sector with their comprehensive software solutions for retail banking.

The role demands a skilled professional with 2-4 years of experience in site reliability engineering, who will be responsible for ensuring the reliability and performance of Zeta's critical systems. The position offers an opportunity to work with cutting-edge technologies including Kubernetes, Docker, and modern cloud platforms, while implementing infrastructure as code and automation practices.

As an SRE II, you'll be at the forefront of maintaining and improving the infrastructure that powers financial services for major clients like Sodexo and HDFC Bank. Your responsibilities will span from system reliability and automation to incident response and disaster recovery planning. The role requires a strong technical foundation in programming (Python, Go), containerization, and cloud technologies, combined with a proactive approach to system optimization and problem-solving.

The ideal candidate will have a B.Tech/M.Tech in computer science or related field, with demonstrated experience in infrastructure automation, monitoring solutions, and security best practices. You'll be part of an on-call rotation, ensuring 24/7 system availability for Zeta's global operations across 8 countries.

This is an excellent opportunity for a mid-level SRE to join a high-growth startup that's reshaping the future of banking technology. You'll work in an environment that values technical excellence, continuous improvement, and innovation, while being part of a company that celebrates diversity and creates an inclusive workplace for all employees.

Last updated a month ago

Responsibilities For Site Reliability Engineer II

  • Ensuring the reliability of software systems by designing, implementing, and maintaining scalable infrastructure
  • Developing automation tools and scripts to streamline operational tasks
  • Monitoring system performance and responding to incidents
  • Analyzing system usage patterns and forecasting future capacity needs
  • Identifying and addressing performance bottlenecks
  • Implementing infrastructure as code practices
  • Implementing and maintaining monitoring and logging solutions
  • Participating in on-call rotation for 24/7 system availability
  • Collaborating with security teams to implement security best practices
  • Developing and maintaining disaster recovery plans
  • Continuously analyzing system performance for improvements

Requirements For Site Reliability Engineer II

Python
Go
Kubernetes
  • 2-4 years of experience in site reliability engineering
  • B.Tech/M.Tech in computer science, information technology or related field
  • Proficiency in Python, Go, Shell, Bash
  • Experience with automation tools (Ansible, Puppet, Chef)
  • Knowledge of Infrastructure as Code tools like Terraform
  • Experience with Docker and Kubernetes
  • Proficiency in cloud platforms (AWS, Azure, or GCP)
  • Familiarity with monitoring tools (Prometheus, Grafana, ELK stack)
  • Understanding of networking concepts and protocols
  • Knowledge of security best practices
  • Experience with CI/CD pipelines
  • Proficient in Git version control

Related Jobs

Site Reliability Engineer II

Site Reliability Engineer II position at Zeta, focusing on maintaining and improving infrastructure reliability and automation in a fintech environment.

Site Reliability Engineer - II

Adobe is hiring a Site Reliability Engineer II in Bangalore to build and maintain scalable, reliable cloud infrastructure and services, requiring 4-8 years of experience with Linux, cloud platforms, and DevOps practices.

Site Reliability Developer

Site Reliability Developer position at Oracle focusing on cloud infrastructure, automation, and service reliability with 3-5+ years experience required.

Systems Engineer III, Site Reliability Engineering

Systems Engineer III position at Google focusing on Site Reliability Engineering, maintaining and improving large-scale distributed systems and enterprise applications.

Systems Engineer III, Site Reliability Engineering

Systems Engineer III position in Google's Site Reliability Engineering team, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.