Site Reliability Engineer

Kong

Cloud native API platform provider with the fastest, most adopted API gateway in the world, enabling companies to become API-first and securely accelerate AI adoption.

Singapore

Site Reliability

Mid-Level Software Engineer

Hybrid

3+ years of experience

Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer

Kong is seeking a Site Reliability Engineer to join their dynamic team in Singapore. As a leading cloud native API platform provider with over 300 million downloads of their API gateway, Kong is at the forefront of enabling companies to become API-first and securely accelerate AI adoption.

The role focuses on building and maintaining critical infrastructure automation workflows, managing deployment processes, and ensuring the reliability of Kong's distributed systems. You'll work with cutting-edge technologies including Kubernetes, Docker, and major cloud platforms (AWS, GCP, Azure), while contributing to the company's mission of building the nervous system that will safely and reliably connect all of humankind.

The ideal candidate will bring strong technical expertise in Linux systems, programming languages like Go or Python, and extensive experience with CI/CD pipelines and Infrastructure as Code. You'll be responsible for improving staging and production environments, implementing monitoring solutions, and ensuring the smooth operation of Kong's services in a 24/7/365 environment.

This is an excellent opportunity for a skilled engineer who wants to work with modern cloud technologies and contribute to a company that powers 83% of today's web traffic through API calls. Kong offers a collaborative environment and values diversity, providing equal opportunities to all qualified applicants.

Last updated 4 months ago

Responsibilities For Site Reliability Engineer

Contributing to a team building and maintaining workflows that automate releasing, testing and deploying products
Improving staging and production environments for SaaS distributions
Research and build monitoring/analyze tools to optimize building and deploying code-base
Manage distributed systems and application resources
Build workflows for delivering software to various platforms including AWS, GCP, Azure
Work with container technologies like Docker and Kubernetes to automate deployment and scale products

Requirements For Site Reliability Engineer

Kafka

Kubernetes

PostgreSQL

Redis

BS degree in Computer Science, similar technical field of study or equivalent practical experience
Experience with continuous/rapid release engineering (CI/CD)
Experience with Infrastructure as Code configuration management systems (Terraform, Chef, Puppet or Ansible)
Experience with Apache Kafka
Experience building and administering alerting and monitoring systems for API services
Strong knowledge of Linux/Unix systems
Knowledge of one or more mainstream programming languages (Go, C/C++, Python)
Strong skills in network services such as DNS, TLS/SSL, HTTP
Experience working in a 24/7/365 service environment
Debugging Kubernetes clusters, issues, and networking problems
Design, implement, manage and orchestrate Kubernetes container clusters