Senior Compute Site Reliability Engineer (GPU)

A leading technology company that designs, develops, and sells consumer electronics, software, and services.
$135,400 - $250,600
DevOps
Staff Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior Compute Site Reliability Engineer (GPU)

At Apple, we're looking for a Senior Compute Site Reliability Engineer (GPU) to join our Software and Services team. This role combines traditional SRE responsibilities with specialized focus on GPU infrastructure, making it perfect for engineers passionate about high-performance computing and reliability.

As an SRE, you'll be at the forefront of maintaining and scaling Apple's GPU-accelerated cloud infrastructure, supporting thousands of development and operations engineers. You'll work with cutting-edge technologies including GPU-based virtual machines, Kubernetes clusters, and modern monitoring tools to ensure our services run efficiently and reliably.

The ideal candidate brings 5+ years of SRE/DevOps experience, with deep knowledge of GPU infrastructure and cloud platforms. You'll need strong skills in implementing and managing GPU-accelerated environments, along with expertise in modern DevOps practices and tools. Your role will be crucial in supporting mission-critical cloud systems, ensuring they maintain constant uptime and scale seamlessly.

What makes this role unique is the opportunity to work at the intersection of high-performance computing and site reliability engineering at Apple's scale. You'll collaborate with data scientists, developers, and various stakeholders, making significant impacts on Apple's infrastructure while working with some of the most advanced GPU computing systems.

Benefits include competitive base pay ($135,400-$250,600), equity opportunities through stock programs, comprehensive healthcare, retirement benefits, and education reimbursement. Join us in shaping the future of GPU-accelerated cloud infrastructure at one of the world's most innovative companies.

Last updated 2 days ago

Responsibilities For Senior Compute Site Reliability Engineer (GPU)

  • Design and deploy GPU-accelerated VM and container infrastructure using platforms such as KVM, Qemu, AWS, or Google Cloud
  • Implement GPU-based Kubernetes clusters to support containerized applications and services
  • Work with data scientists, developers, and other stakeholders to understand requirements
  • Implement best practices for security, scalability, and high availability environments
  • Monitor and optimize resource utilization to ensure performance and cost-efficiency
  • Actively participate in capacity planning, scale testing, and disaster recovery exercises
  • Troubleshoot issues across the entire infrastructure stack
  • Cultivate and maintain relationships with internal and external third-party vendors

Requirements For Senior Compute Site Reliability Engineer (GPU)

Kubernetes
Go
  • 5+ years in a Site Reliability Engineering, DevOps, or Infrastructure focused role
  • Proven experience with GPU-based virtual machine infrastructure and cloud platforms (e.g., AWS, GCP)
  • Experience with GPU hardware (e.g., NVIDIA, AMD) and associated software stack (e.g., CUDA, cuDNN)
  • Experience with GitOps, CI/CD tools, and deployment strategies like Spinnaker, Argo
  • Ability to implement and coordinate telemetry using monitoring and observability tools
  • Outstanding organizational and communications skills
  • BS/MS degree (Engineering or Computer Science) or equivalent work experience

Benefits For Senior Compute Site Reliability Engineer (GPU)

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
Education Budget
Relocation Benefits
  • Comprehensive medical and dental coverage
  • Retirement benefits
  • Employee stock programs
  • Education reimbursement
  • Discretionary bonuses
  • Relocation assistance

Interested in this job?

Jobs Related To Apple Senior Compute Site Reliability Engineer (GPU)

System Infrastructure Developer

Senior infrastructure development role at Apple focusing on silicon technology and CAD automation systems.

Senior DevOps Engineer

Senior DevOps Engineer role at Apple, focusing on infrastructure automation, cloud platforms, and operational excellence, offering competitive compensation and comprehensive benefits.

Sr Software Engineer - Infrastructure and operations

Senior Software Engineer role at Apple focusing on infrastructure and operations for ML Systems Evaluation Engineering team.

Software Engineer (Tools), Engagement Engineering

Senior Software Engineer role at Apple focusing on building developer tools and infrastructure for Apple's platforms, offering competitive compensation and benefits.

Software Engineer: DevOps/Automation Engineer

DevOps/Automation Engineer role at Apple, focusing on release tooling and infrastructure for silicon validation team. 10+ years experience required. Salary range: $175,800-$312,200.