Taro Logo

Site Reliability Engineer, IS&T Ai & Data Platforms

Apple is where individual imaginations gather together, committing to the values that lead to great work.
$147,400 - $220,900
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Job Description

Apple's Artificial Intelligence and Data Platforms (AiDP) team is seeking a Site Reliability Engineer to build and maintain high quality, scalable and resilient distributed systems that power Apple's enterprise solutions and data pipelines. This role focuses on developing, operating, and maintaining infrastructure for diverse solutions including web applications, data pipelines, and batch operations. The position emphasizes automation and self-service capabilities to efficiently manage large volumes of containers and virtual machines with minimal human intervention. The role requires expertise in Kubernetes, observability tools, and CI/CD pipelines, working within both cloud and on-premises environments. The ideal candidate will collaborate across teams to ensure robust system design and operation while maintaining high standards of reliability and performance. This is an opportunity to work on infrastructure problems at scale, implementing best practices and driving innovation in one of the world's leading technology companies. The role offers competitive compensation, comprehensive benefits, and the chance to contribute to systems that support critical business functions across Sales, Operations, Finance, AppleCare, Marketing and Internet Services.

Last updated 2 days ago

Responsibilities For Site Reliability Engineer, IS&T Ai & Data Platforms

  • Implement and manage Kubernetes clusters at scale across cloud and on-premises environments
  • Develop and maintain observability capabilities for various application profiles
  • Construct CI/CD pipelines for efficient code deployment
  • Prepare alert handling procedures and run-books
  • Participate in production environment troubleshooting
  • Collaborate with Software Development, Business stakeholders, and other teams
  • Provide design and architectural input for new solutions

Requirements For Site Reliability Engineer, IS&T Ai & Data Platforms

Kubernetes
Python
Java
  • Proven experience in creating, managing, and operating Kubernetes clusters at scale
  • Proficiency in Helm or Kustomize for Kubernetes applications
  • Experience in GitOps-based deployment tools (Spinnaker, Flux, ArgoCD)
  • Knowledge of infrastructure templating tools like CloudFormation and Terraform
  • Bachelor's degree in Computer Science or equivalent experience

Benefits For Site Reliability Engineer, IS&T Ai & Data Platforms

401k
Medical Insurance
Dental Insurance
Vision Insurance
Equity
Education Budget
Relocation Benefits
  • Comprehensive medical and dental coverage
  • Retirement benefits
  • Employee stock programs
  • Education reimbursement
  • Discretionary bonuses
  • Relocation assistance