Carrier Services at Apple offers seamless integration of Apple Retail Stores and Apple Online store with major US Carriers for iPhone activations. We are seeking a talented Site Reliability Engineer to join our growing team. This role focuses on ensuring the reliability, scalability, and performance of our critical systems and services.
The ideal candidate will have extensive hands-on experience working as an SRE engineer for large-scale, customer-facing Cloud applications. You'll need a strong understanding of SRE principles, including monitoring, alerting, error budgets, and fault analysis. The role requires excellent troubleshooting and problem-solving skills, with the ability to represent the SRE organization in design reviews and operational readiness exercises.
You'll work with both technical and non-technical teams, analyzing statistics to maintain system health. Knowledge of Oracle and Cassandra databases is valuable. We're looking for someone passionate about automation, with strong networking and load balancing expertise. The role involves leading a small team, making business-critical decisions, and thriving in a dynamic environment.
Key responsibilities include proactive handling of critical production issues, participating in on-call rotations, and implementing robust monitoring solutions. You'll be part of a team that ensures the seamless operation of Apple's retail technology infrastructure, directly impacting millions of customers' experiences.
This is an excellent opportunity for an experienced SRE to make a significant impact at one of the world's most innovative technology companies, working on systems that power Apple's retail operations globally.