At Apple, we're looking for a Senior Compute Site Reliability Engineer (GPU) to join our Software and Services team. This role combines traditional SRE responsibilities with specialized focus on GPU infrastructure, making it perfect for engineers passionate about high-performance computing and reliability.
As an SRE, you'll be at the forefront of maintaining and scaling Apple's GPU-accelerated cloud infrastructure, supporting thousands of development and operations engineers. You'll work with cutting-edge technologies including GPU-based virtual machines, Kubernetes clusters, and modern monitoring tools to ensure our services run efficiently and reliably.
The ideal candidate brings 5+ years of SRE/DevOps experience, with deep knowledge of GPU infrastructure and cloud platforms. You'll need strong skills in implementing and managing GPU-accelerated environments, along with expertise in modern DevOps practices and tools. Your role will be crucial in supporting mission-critical cloud systems, ensuring they maintain constant uptime and scale seamlessly.
What makes this role unique is the opportunity to work at the intersection of high-performance computing and site reliability engineering at Apple's scale. You'll collaborate with data scientists, developers, and various stakeholders, making significant impacts on Apple's infrastructure while working with some of the most advanced GPU computing systems.
Benefits include competitive base pay ($135,400-$250,600), equity opportunities through stock programs, comprehensive healthcare, retirement benefits, and education reimbursement. Join us in shaping the future of GPU-accelerated cloud infrastructure at one of the world's most innovative companies.