Rackspace Technology is seeking a Senior Site Reliability Engineer to join their Professional Services Center of Excellence focusing on Application Performance Monitoring Suites. This role combines modern SRE practices with observability using tools like Datadog, New Relic, AppDynamics, and Dynatrace to create exceptional customer experiences.
As an SRE at Rackspace, you'll be at the intersection of application performance, user experience, and business outcomes. You'll work with cutting-edge observability tools to help customers understand and optimize their applications. The role involves implementing sophisticated monitoring solutions, building scalable systems, and maintaining robust automation to support engineering goals.
The ideal candidate brings 3+ years of extensive experience in cloud infrastructure (AWS EKS, Azure AKS), Kubernetes, and observability tools. You'll need strong expertise in Kafka for large-scale environments, security operations, and disaster recovery strategies. The position requires proficiency in Python, Go, and bash scripting, along with deep knowledge of monitoring tools like Prometheus, Grafana, and Datadog.
Rackspace offers a collaborative environment where you'll work with development teams to implement new features while ensuring reliability and performance standards. The company has been consistently recognized as a best place to work by Fortune, Forbes, and Glassdoor, offering an inclusive culture that values diverse perspectives and innovative thinking.
This remote position offers the opportunity to shape the future of observability engineering while working with a leading multicloud solutions provider. You'll be part of a team that embraces technology and empowers customers to accelerate their digital transformation journey.