Senior Site Reliability Engineer - Cisco ThousandEyes

Cisco

Cisco is a global technology leader that designs and sells networking, security, and communications technology solutions.

2780 Oeiras, Portugal

Site Reliability

Senior Software Engineer

Hybrid

5,000+ Employees

5+ years of experience

Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior Site Reliability Engineer - Cisco ThousandEyes

Cisco ThousandEyes is seeking a Senior Site Reliability Engineer to join their Production Engineering team. ThousandEyes is a leading Digital Experience Assurance platform that helps organizations deliver seamless digital experiences across networks. The role focuses on designing and managing large-scale, highly available distributed systems in the cloud, working directly with application development teams to enhance platform reliability, performance, and security.

The ideal candidate will have expert-level knowledge of Kubernetes and its ecosystem, strong proficiency in Python or Go programming, and deep understanding of cloud providers (especially AWS). They will be responsible for identifying and solving operational excellence challenges, implementing scalable solutions, and maintaining a growing infrastructure with emphasis on automation and code-driven operations.

This is a hybrid position based in Oeiras, Portugal, requiring one day per week in the office. The role involves participating in 24x7 incident response, working with cloud-native tools like Prometheus, Istio, and ArgoCD, and collaborating closely with development teams to optimize service architecture for availability and performance.

The position offers comprehensive benefits including medical, dental, and vision insurance, 401k with company match, disability coverage, and various time-off benefits. Cisco values diverse perspectives and encourages applications from candidates with varied backgrounds, emphasizing potential over traditional qualifications.

Working at Cisco ThousandEyes means joining a team at the forefront of network monitoring and digital experience assurance, with opportunities to work on challenging technical problems at scale while contributing to a product that helps organizations maintain reliable digital services.

Last updated 2 months ago

Responsibilities For Senior Site Reliability Engineer - Cisco ThousandEyes

Identify and provide solutions to common obstacles hindering operational excellence across engineering teams
Partner with application developers using cloud-native tools to address challenges around scale, performance, and reliability
Generalize and standardize solutions and processes across microservice-based multi-region platform
Manage rapidly growing infrastructure with emphasis on operations/infrastructure/everything as code
Participate in 24x7 incident response and on-call rotation
Design and implement scalable operations tooling
Design, deploy, and maintain AWS cloud-native services
Develop automation solutions for service and platform operations

Requirements For Senior Site Reliability Engineer - Cisco ThousandEyes

Kubernetes

Python

Linux

Expert-level knowledge of Kubernetes and its ecosystem
Proficiency in software development with Python or Go
In-depth knowledge of cloud providers, preferably AWS
Strong understanding of Unix/Linux systems
Knowledge of Site Reliability principles
Excellent communication and documentation skills
Strong sense of ownership and attention to detail
5+ years of experience in a related role

Benefits For Senior Site Reliability Engineer - Cisco ThousandEyes

Medical Insurance

Dental Insurance

Vision Insurance

401k

Medical insurance
Dental insurance
Vision insurance
401k with company match
Short and long-term disability coverage
Basic life insurance
Paid holidays
Vacation time
Sick time off
Volunteer time off