Systems Reliability Engineer (SRE) - Edge

Cloudflare

Cloudflare runs one of the world's largest networks powering millions of websites, protecting and accelerating Internet applications without hardware or software changes.

Austin, TX, USA

Site Reliability

Mid-Level Software Engineer

Hybrid

5,000+ Employees

3+ years of experience

Enterprise SaaS · Cybersecurity

Job Description

Cloudflare is seeking a Systems Reliability Engineer (SRE) to join their Edge platform team, operating across more than 320 cities in over 120 countries. This role sits at the intersection of systems, network, and software engineering, focusing on maintaining and improving Cloudflare's vast global network infrastructure.

The position requires a strong background in automation, scalability, and operational excellence. Working in a "follow the sun" model across global offices, you'll be responsible for building tools to enhance service availability, performance, and operational efficiency. The role demands a passionate curiosity about Internet fundamentals, combined with strong knowledge of networking, Linux, and TLS, along with coding abilities in languages like Go, Rust, or Python.

As an SRE at Cloudflare, you'll be part of a team that manages the immediate state and functionality of Cloudflare's worldwide platform. You'll work with various monitoring, alerting, and diagnostic tools while continuously improving the platform's capabilities. The role involves owning a wide portfolio of applications and services, maintaining a tight feedback loop between development and operations.

The ideal candidate should have at least 3 years of experience in an SRE role or similar position, with strong Linux systems experience and software development skills. Knowledge of distributed systems, network protocols, and system design trade-offs is essential. Experience with tools like Nginx, PostgreSQL, Docker, Prometheus, and Grafana would be advantageous.

This is an excellent opportunity to join a high-performing team at a company that's helping build a better Internet. Cloudflare's mission extends beyond commercial success, with initiatives like Project Galileo protecting journalism and civil society organizations, and the Athenian Project securing election websites. The company values diversity and inclusiveness, seeking curious and empathetic individuals committed to personal growth and learning.

Last updated a month ago

Responsibilities For Systems Reliability Engineer (SRE) - Edge

Build and operate Edge platform running in more than 320 cities
Improve service availability, performance, and operational velocity
Develop and enhance the Cloudflare platform
Monitor and maintain platform functionality
Build automation tools for system reliability

Requirements For Systems Reliability Engineer (SRE) - Edge

Python

Linux

PostgreSQL

Linux systems experience
3 years experience in an SRE role or similar functions
Software development skills in Go, Rust, or Python
Understanding of distributed software systems
Intermediate experience of common network protocols like DNS and HTTP

Cloudflare

Cloudflare runs one of the world's largest networks powering millions of websites, protecting and accelerating Internet applications without hardware or software changes.

Austin, TX, USA

Site Reliability

Mid-Level Software Engineer

Hybrid

5,000+ Employees

3+ years of experience

Enterprise SaaS · Cybersecurity

Related Jobs

Site Reliability Engineer

Global Payments

Site Reliability Engineer position at Global Payments, focusing on API operations and infrastructure management with hybrid work options in multiple US locations.

Site Reliability Engineer II - CTJ - Top Secret

Microsoft

Site Reliability Engineer II position at Microsoft working on Defender security products for government clouds, requiring Top Secret clearance and offering competitive compensation with comprehensive benefits.

Site Reliability Engineer

Global Payments

Site Reliability Engineer position at Global Payments, focusing on maintaining and improving API operations and system reliability for a leading payment processing company.

Site Reliability Engineer (SRE) – Infrastructure and Observability

Worldpay

Site Reliability Engineer role at Worldpay focusing on infrastructure and observability, requiring 3+ years of experience in SRE/DevOps and expertise in monitoring tools and automation.

Site Reliability Engineer II - CTJ - Poly

Microsoft

Microsoft is hiring a Site Reliability Engineer II for their Identity team to support Azure Government Secret and Top-Secret Clouds, offering hybrid work and comprehensive benefits.