Site Reliability Engineer

Apple

Apple is a technology company that designs, develops, and sells consumer electronics, computer software, and online services.

Bengaluru, Karnataka, India

Site Reliability

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer

Apple Services Engineering (ASE) builds and supports the systems that make many daily experiences possible. The Site Reliability Engineering (SRE) teams are responsible for the systems and services that directly support customers and their experiences. As an SRE, you will apply best practices to ensure the availability, reliability, and performance of our systems and services.

Key responsibilities include:

Engaging with product teams to understand requirements and implement resilient infrastructure solutions
Operating, monitoring, and triaging production and non-production environments
Collaborating on code, infrastructure, design reviews, and process enhancements
Evaluating and integrating new technologies
Developing automation for provisioning, configuration, deployment, and monitoring
Participating in on-call rotation
Contributing to capacity planning, scale testing, and disaster recovery exercises
Approaching operational problems with a software engineering mindset

This role offers the opportunity to work on highly available customer-facing services and contribute to the systems that power Apple's innovative products and services. You'll be part of a team that values diversity, innovation, and leaving the world better than we found it.

Last updated 9 months ago

Responsibilities For Site Reliability Engineer

Engage with product teams to understand requirements and implement resilient infrastructure solutions
Operate, monitor, and triage production and non-production environments
Collaborate on code, infrastructure, design reviews, and process enhancements
Evaluate and integrate new technologies to improve system reliability, security, and performance
Develop and implement automation for provisioning, configuration, deployment, and monitoring
Participate in on-call rotation providing hands-on technical expertise
Contribute to capacity planning, scale testing, and disaster recovery exercises
Approach operational problems with a software engineering mindset

Requirements For Site Reliability Engineer

Java

Python

Linux

Kubernetes

BS degree in computer science or equivalent field with 5+ years of experience
5+ years in an Infrastructure Ops, Site Reliability Engineering, or DevOps-focused role
Knowledge of Linux operating system principles, networking fundamentals, and systems management
Demonstrable fluency in at least one of the following languages: Java, Python, or Go
Experience managing and scaling distributed systems in a public, private, or hybrid cloud environment