Taro Logo

Production Services Site Reliability Engineer

A leading technology company that designs, develops, and sells consumer electronics, software, and services.
$129,600 - $236,300
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Sr. Site Reliability Engineer

Senior Site Reliability Engineer role at Adobe focusing on platform reliability, scalability, and DevOps practices, offering competitive compensation and opportunity to work with cutting-edge technology.

Site Reliability Engineer

Senior Site Reliability Engineer role at Dark Wolf Solutions supporting NBIS with cloud operations and monitoring.

Senior Software Engineer, Site Reliability Tooling

Senior SRE Engineer role at Upstart focusing on building tooling and automation for infrastructure monitoring and reliability

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Thomson Reuters focused on maintaining and improving system reliability and performance.

Network Site Reliability Engineer

Network Site Reliability Engineer position at NVIDIA focusing on maintaining and optimizing network infrastructure reliability.

Description For Production Services Site Reliability Engineer

The Production Services Site Reliability Engineer (SRE) role at Apple is a critical position within the Software Delivery organization, which forms the backbone of Apple's software release process. This role focuses on maintaining and optimizing Atlassian services used by software engineers and project managers worldwide to develop Apple's software products.

As an SRE, you'll be responsible for applying site reliability engineering practices to maintain critical services like Bitbucket, Confluence, and Jira. These tools are essential in delivering state-of-the-art operating systems, applications, and firmware to Apple customers globally. Your work will directly impact the efficiency and reliability of Apple's software development pipeline.

Key responsibilities include configuring and monitoring both on-premises and cloud-based dependencies, automating CI/CD pipelines, and maintaining high-availability environments. You'll implement comprehensive observability solutions, generate performance metrics reports, and champion best practices in change management and incident response.

The ideal candidate will have a strong background in distributed systems, with experience in managing customer-facing systems in a 24/7 environment. You should be comfortable working with container platforms, monitoring tools, and data analysis systems. The role requires excellent communication skills, as you'll be working with a global team across multiple time zones.

At Apple, you'll be part of a team that values proactive communication, continuous learning, and innovative problem-solving. The position offers competitive compensation, comprehensive benefits, and the opportunity to work on systems that impact millions of developers and ultimately, Apple's global customer base.

This role is perfect for someone who is passionate about building and maintaining reliable, scalable systems, has strong technical expertise in SRE practices, and wants to contribute to the development of world-class software products at one of the world's most innovative technology companies.

Last updated 5 days ago

Responsibilities For Production Services Site Reliability Engineer

  • Configuration and monitoring of on-prem and cloud-based dependencies
  • Automate continuous integration (CI) and continuous delivery (CD) pipelines
  • Maintain staging and production environments with goal of maximizing uptimes
  • Implement observability of systems for monitoring, alerting, and metrics reporting
  • Generate reports regarding service metrics on performance, availability, and reliability
  • Champion practices regarding change control management and incident response

Requirements For Production Services Site Reliability Engineer

Kubernetes
Linux
  • B.S. in Computer Science or related work experience
  • Passion in building reliable, scalable, and performant distributed systems
  • Understanding of distributed systems w.r.t. application, networking, and security
  • SRE or Dev/Ops experience in managing customer-facing systems in 24/7 environment
  • Experience in managing and monitoring fleets of *nix systems or container platforms
  • Excellent judgment and integrity with ability to make timely and sound decisions
  • Ability to anticipate the needs of others and adapt to changing conditions

Benefits For Production Services Site Reliability Engineer

401k
Medical Insurance
Dental Insurance
Vision Insurance
Education Budget
Equity
  • Comprehensive medical and dental coverage
  • Retirement benefits
  • Employee stock programs
  • Education reimbursement
  • Discretionary bonuses
  • Relocation assistance

Interested in this job?