Cisco ThousandEyes, a leading Digital Experience Assurance platform, is seeking a Senior Site Reliability Engineer to join their Production Engineering team in London. This role offers an exciting opportunity to work with cutting-edge cloud technologies and contribute to a platform that's deeply integrated across Cisco's extensive technology portfolio.
The ideal candidate will be responsible for designing and managing large-scale, highly available distributed systems in the cloud. You'll work directly with application development teams to enhance the reliability, performance, and security of the platform. The role involves working with modern technologies including Kubernetes, AWS, and various CNCF solutions.
Key responsibilities include optimizing architecture for availability and performance, implementing scalable operations tooling, and participating in incident response. You'll be instrumental in automating production operations and developing solutions for platform scaling across multiple regions.
The position requires expert-level knowledge of Kubernetes, proficiency in Python or Go, and strong understanding of cloud platforms, particularly AWS. With a hybrid work arrangement requiring at least one day per week in the London office, this role offers an excellent opportunity to work on challenging technical problems while maintaining work-life balance.
Cisco ThousandEyes values diverse perspectives and encourages applications from candidates with varied backgrounds, emphasizing potential over traditional qualifications. The company offers a collaborative environment where you'll work with cutting-edge technologies while contributing to a platform that helps organizations deliver seamless digital experiences.