Taro Logo

Customer Reliability Engineer, Infrastructure

Company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®, empowering data teams with mission-critical software, analytics, and AI.
$130,000 - $150,000
DevOps
Senior Software Engineer
Remote
4+ years of experience
Enterprise SaaS · AI
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Customer Reliability Engineer, Infrastructure

Astronomer, the company behind the industry-leading Astro DataOps platform powered by Apache Airflow®, is seeking a Customer Reliability Engineer specializing in infrastructure. This role is perfect for technology enthusiasts who strive for expertise and excellence in their field.

As a CRE at Astronomer, you'll be at the forefront of managing and optimizing our managed Airflow service across AWS, Azure, and GCP. The position requires deep technical knowledge of Kubernetes and cloud infrastructure, combined with strong customer-facing skills. You'll work in a remote-first environment with a competitive salary range of $130,000 - $150,000 plus equity.

The role offers unique opportunities to work with cutting-edge cloud-native technologies while directly impacting customer success. You'll split your time between core platform reliability work and innovative side projects, including contributions to open-source Airflow. The position requires working from 9AM-3PM Eastern US time, with flexible remaining hours, and includes participation in on-call rotations.

What makes this role special is its blend of technical depth and customer interaction. You'll not only master complex infrastructure systems but also build strong relationships with customers, helping them achieve their reliability goals. The role offers extensive learning opportunities across multiple disciplines, including Kubernetes, cloud engineering, and networking.

Astronomer values diverse experiences and unconventional career paths. The company serves over 700 leading enterprises, providing a platform that accelerates building reliable data products for insights, AI value, and data-driven applications. This role offers the chance to be part of a growing company while working with sophisticated, cloud-native technology that connects to dozens of systems.

The ideal candidate brings 4 years of professional experience, strong Linux and container expertise, and excellent communication skills. Experience with major cloud providers is essential, while DevOps background and open-source contributions are valuable bonuses. Join a team that values diversity, equal opportunity, and the power of remote collaboration.

Last updated 2 months ago

Responsibilities For Customer Reliability Engineer, Infrastructure

  • Operate, monitor, and maintain the platform to ensure availability, predictability, and reliable operations
  • Become an expert on the reliability of Kubernetes and underlying cloud infrastructure
  • Create strong relationships with customers and help them achieve reliability goals
  • Provide feedback to shape product direction
  • Own customer experience and meet SLAs
  • Participate in 24x7 coverage through specified 6-hour pager period
  • Participate in paid on-call rotation for weekend coverage
  • Work directly with customers' data engineers, system admins, DevOps teams, and management

Requirements For Customer Reliability Engineer, Infrastructure

Kubernetes
Linux
  • 4 years of professional experience
  • Experience with Kubernetes/Docker/Containers
  • Experience with any major cloud provider (AWS, GCP, Azure)
  • Demonstrable Linux familiarity
  • Excellent written and verbal communication skills
  • Problem-solving and troubleshooting abilities
  • Commitment to excellence
  • Motivation to learn

Benefits For Customer Reliability Engineer, Infrastructure

Equity
  • Remote-first company
  • Equity component
  • 2-4 in-person events per year

Interested in this job?