The Apple Service Engineering - SRE team is seeking experienced Site Reliability Engineers to develop and maintain large-scale distributed systems. As part of this role, you'll work on building next-generation search infrastructure and platform services, collaborating with various ASE teams from store and commerce to search and recommendations.
The position focuses on developing applications and tooling that are safe, reliable, scalable, and fast. You'll be responsible for managing Voldemort key-value distributed database infrastructure deployment on both on-premise bare metal and public cloud platforms. This includes handling maintenance, deployment automation, backup, observability, and telemetry, with a strong emphasis on reliability, performance, and scaling.
Your role will involve working with cutting-edge technologies and implementing both open source and home-grown solutions to provide managed data infrastructure services. You'll be part of a team that ensures Apple's services remain reliable, scalable, and secure, while delivering continuous data store availability to ASE Media Applications.
The ideal candidate should be comfortable questioning assumptions, working effectively under tight deadlines, and developing elegant technical solutions to complex problems. You'll have the opportunity to collaborate with engineers across Apple, define metrics, set targets, and uncover optimization opportunities that will directly impact customer experience.
This position offers significant growth potential and the chance to work on large cross-organizational projects. Your contributions will help shape the future of Apple's service infrastructure, and your innovative ideas will be valued and rewarded. The role provides competitive compensation, comprehensive benefits, and the opportunity to work with some of the best minds in technology while delivering services that millions of customers rely on daily.