Taro Logo

Site Reliability Engineer

Cloud native API platform provider with the fastest, most adopted API gateway in the world, enabling companies to become API-first and securely accelerate AI adoption.
Site Reliability
Mid-Level Software Engineer
Hybrid
3+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer

Kong is seeking a Site Reliability Engineer to join their dynamic team in Singapore. As a leading cloud native API platform provider with over 300 million downloads of their API gateway, Kong is at the forefront of enabling companies to become API-first and securely accelerate AI adoption.

The role focuses on building and maintaining critical infrastructure automation workflows, managing deployment processes, and ensuring the reliability of Kong's distributed systems. You'll work with cutting-edge technologies including Kubernetes, Docker, and major cloud platforms (AWS, GCP, Azure), while contributing to the company's mission of building the nervous system that will safely and reliably connect all of humankind.

The ideal candidate will bring strong technical expertise in Linux systems, programming languages like Go or Python, and extensive experience with CI/CD pipelines and Infrastructure as Code. You'll be responsible for improving staging and production environments, implementing monitoring solutions, and ensuring the smooth operation of Kong's services in a 24/7/365 environment.

This is an excellent opportunity for a skilled engineer who wants to work with modern cloud technologies and contribute to a company that powers 83% of today's web traffic through API calls. Kong offers a collaborative environment and values diversity, providing equal opportunities to all qualified applicants.

Last updated 4 months ago

Responsibilities For Site Reliability Engineer

  • Contributing to a team building and maintaining workflows that automate releasing, testing and deploying products
  • Improving staging and production environments for SaaS distributions
  • Research and build monitoring/analyze tools to optimize building and deploying code-base
  • Manage distributed systems and application resources
  • Build workflows for delivering software to various platforms including AWS, GCP, Azure
  • Work with container technologies like Docker and Kubernetes to automate deployment and scale products

Requirements For Site Reliability Engineer

Go
Kafka
Kubernetes
PostgreSQL
Redis
  • BS degree in Computer Science, similar technical field of study or equivalent practical experience
  • Experience with continuous/rapid release engineering (CI/CD)
  • Experience with Infrastructure as Code configuration management systems (Terraform, Chef, Puppet or Ansible)
  • Experience with Apache Kafka
  • Experience building and administering alerting and monitoring systems for API services
  • Strong knowledge of Linux/Unix systems
  • Knowledge of one or more mainstream programming languages (Go, C/C++, Python)
  • Strong skills in network services such as DNS, TLS/SSL, HTTP
  • Experience working in a 24/7/365 service environment
  • Debugging Kubernetes clusters, issues, and networking problems
  • Design, implement, manage and orchestrate Kubernetes container clusters