Senior Compute Site Reliability Engineer (GPU)

Apple

A leading technology company that designs, develops, and sells consumer electronics, software, and services.

Seattle, WA, USA

$135,400 - $250,600

DevOps

Staff Software Engineer

In-Person

5,000+ Employees

5+ years of experience

AI · Enterprise SaaS

Description For Senior Compute Site Reliability Engineer (GPU)

At Apple, we're looking for a Senior Compute Site Reliability Engineer (GPU) to join our Software and Services team. This role combines traditional SRE responsibilities with specialized focus on GPU infrastructure, making it perfect for engineers passionate about high-performance computing and reliability.

As an SRE, you'll be at the forefront of maintaining and scaling Apple's GPU-accelerated cloud infrastructure, supporting thousands of development and operations engineers. You'll work with cutting-edge technologies including GPU-based virtual machines, Kubernetes clusters, and modern monitoring tools to ensure our services run efficiently and reliably.

The ideal candidate brings 5+ years of SRE/DevOps experience, with deep knowledge of GPU infrastructure and cloud platforms. You'll need strong skills in implementing and managing GPU-accelerated environments, along with expertise in modern DevOps practices and tools. Your role will be crucial in supporting mission-critical cloud systems, ensuring they maintain constant uptime and scale seamlessly.

What makes this role unique is the opportunity to work at the intersection of high-performance computing and site reliability engineering at Apple's scale. You'll collaborate with data scientists, developers, and various stakeholders, making significant impacts on Apple's infrastructure while working with some of the most advanced GPU computing systems.

Benefits include competitive base pay ($135,400-$250,600), equity opportunities through stock programs, comprehensive healthcare, retirement benefits, and education reimbursement. Join us in shaping the future of GPU-accelerated cloud infrastructure at one of the world's most innovative companies.

Last updated 2 days ago

Responsibilities For Senior Compute Site Reliability Engineer (GPU)

Design and deploy GPU-accelerated VM and container infrastructure using platforms such as KVM, Qemu, AWS, or Google Cloud
Implement GPU-based Kubernetes clusters to support containerized applications and services
Work with data scientists, developers, and other stakeholders to understand requirements
Implement best practices for security, scalability, and high availability environments
Monitor and optimize resource utilization to ensure performance and cost-efficiency
Actively participate in capacity planning, scale testing, and disaster recovery exercises
Troubleshoot issues across the entire infrastructure stack
Cultivate and maintain relationships with internal and external third-party vendors

Requirements For Senior Compute Site Reliability Engineer (GPU)

Kubernetes

5+ years in a Site Reliability Engineering, DevOps, or Infrastructure focused role
Proven experience with GPU-based virtual machine infrastructure and cloud platforms (e.g., AWS, GCP)
Experience with GPU hardware (e.g., NVIDIA, AMD) and associated software stack (e.g., CUDA, cuDNN)
Experience with GitOps, CI/CD tools, and deployment strategies like Spinnaker, Argo
Ability to implement and coordinate telemetry using monitoring and observability tools
Outstanding organizational and communications skills
BS/MS degree (Engineering or Computer Science) or equivalent work experience

Benefits For Senior Compute Site Reliability Engineer (GPU)

Medical Insurance

Dental Insurance

Vision Insurance

401k

Equity

Education Budget

Relocation Benefits

Comprehensive medical and dental coverage
Retirement benefits
Employee stock programs
Education reimbursement
Discretionary bonuses
Relocation assistance

Apple

A leading technology company that designs, develops, and sells consumer electronics, software, and services.

Seattle, WA, USA

$135,400 - $250,600

DevOps

Staff Software Engineer

In-Person

5,000+ Employees

5+ years of experience

AI · Enterprise SaaS

Interested in this job?

Jobs Related To Apple Senior Compute Site Reliability Engineer (GPU)

System Infrastructure Developer

Apple

Senior infrastructure development role at Apple focusing on silicon technology and CAD automation systems.

Senior DevOps Engineer

Apple

Senior DevOps Engineer role at Apple, focusing on infrastructure automation, cloud platforms, and operational excellence, offering competitive compensation and comprehensive benefits.

Sr Software Engineer - Infrastructure and operations

Apple

Senior Software Engineer role at Apple focusing on infrastructure and operations for ML Systems Evaluation Engineering team.

Software Engineer (Tools), Engagement Engineering

Apple

Senior Software Engineer role at Apple focusing on building developer tools and infrastructure for Apple's platforms, offering competitive compensation and benefits.

Software Engineer: DevOps/Automation Engineer

Apple

DevOps/Automation Engineer role at Apple, focusing on release tooling and infrastructure for silicon validation team. 10+ years experience required. Salary range: $175,800-$312,200.