Site Reliability Engineer

A platform helping millions of businesses scale with automation and AI, making automation work for everyone.
$61,200 - $79,200
Site Reliability
Mid-Level Software Engineer
Remote
2+ years of experience
Enterprise SaaS · AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Site Reliability Developer 2

Site Reliability Developer position at Oracle focusing on cloud infrastructure, automation, and system reliability, requiring 3-5 years of experience with Linux, Python, and cloud technologies.

Software Engineer - Incident Management

Software Engineer position at Datadog focusing on incident management, building tools and processes to improve system reliability and incident response across the organization.

ASE -Site Reliability Engineer

Site Reliability Engineer role at Apple focused on distributed systems and coordination services, offering competitive pay and comprehensive benefits.

Site reliability/Platform Engineer/Sys Dev Engineer, ESC

AWS System Development Engineer position focusing on cloud infrastructure management, combining software development with systems engineering to maintain and improve AWS's global network infrastructure.

Site Reliability Engineer, ESC Managed Operations

AWS seeks Site Reliability Engineer for European Sovereign Cloud launch, focusing on high-availability services and operations management with strong emphasis on security and performance.

Description For Site Reliability Engineer

Zapier is seeking a Site Reliability Engineer to join their remote-first team, focusing on scaling their automation and AI platform. The role requires 2-5 years of experience in cloud engineering or systems administration, with expertise in cloud platforms like AWS and infrastructure as code tools. You'll be responsible for managing Kubernetes clusters, serverless functions, and implementing site reliability engineering principles.

The position offers an opportunity to work on significant projects like their in-house routing service and Terraform infrastructure optimization. You'll be part of a team that values automation, problem-solving, and building resilient systems. The role involves designing and deploying AWS infrastructure, managing EKS clusters, and building services to handle high-traffic workloads.

Zapier offers a competitive compensation package and embraces a culture of transparency, equity, and remote work. They're committed to diversity and inclusion, welcoming applications from candidates of all backgrounds. The company's mission is to make automation accessible to everyone, and as a Site Reliability Engineer, you'll play a crucial role in maintaining and improving their infrastructure reliability.

Working at Zapier means joining a team that believes in the power of automation to transform businesses. You'll collaborate with talented engineers, use cutting-edge tools, and have the flexibility of remote work. The company provides a supportive environment for growth and values effective communication and knowledge sharing. If you're passionate about cloud engineering, automation, and building reliable systems at scale, this role offers an excellent opportunity to make a significant impact.

Last updated a month ago

Responsibilities For Site Reliability Engineer

  • Design and deploy AWS infrastructure using Terraform and Helm
  • Manage and govern Kubernetes clusters (EKS) and serverless functions (Lambda)
  • Evaluate and recommend new infrastructure tools and technologies
  • Partner with teams to solve infrastructure and design problems
  • Build services to integrate systems and process high-traffic workloads
  • Apply site reliability engineering principles
  • Build new features and services
  • Participate in incident response
  • Participate in business hours on-call support

Requirements For Site Reliability Engineer

Python
Go
Kubernetes
Linux
  • 2-5 years of experience in cloud engineering, systems administration, or related field
  • Experience with cloud platforms (AWS, GCP, or Azure)
  • Proficiency in Python, Go, or similar programming languages
  • Experience with infrastructure as code tools
  • Strong problem-solving skills
  • Effective communication skills
  • Knowledge of reliability and observability practices

Benefits For Site Reliability Engineer

  • Competitive pay in technology sector
  • Remote work flexibility
  • Equitable pay practices

Interested in this job?