Taro Logo

Software Engineer (Site Reliability), Retail Engineering

Global technology company that designs, develops, and sells consumer electronics, software, and services.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
4+ years of experience
Enterprise SaaS · Retail
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Software Engineer (Site Reliability), Retail Engineering

Carrier Services at Apple offers seamless integration of Apple Retail Stores and Apple Online store with major US Carriers for iPhone activations. We are seeking a talented Site Reliability Engineer to join our growing team. This role focuses on ensuring the reliability, scalability, and performance of our critical systems and services.

The ideal candidate will have extensive hands-on experience working as an SRE engineer for large-scale, customer-facing Cloud applications. You'll need a strong understanding of SRE principles, including monitoring, alerting, error budgets, and fault analysis. The role requires excellent troubleshooting and problem-solving skills, with the ability to represent the SRE organization in design reviews and operational readiness exercises.

You'll work with both technical and non-technical teams, analyzing statistics to maintain system health. Knowledge of Oracle and Cassandra databases is valuable. We're looking for someone passionate about automation, with strong networking and load balancing expertise. The role involves leading a small team, making business-critical decisions, and thriving in a dynamic environment.

Key responsibilities include proactive handling of critical production issues, participating in on-call rotations, and implementing robust monitoring solutions. You'll be part of a team that ensures the seamless operation of Apple's retail technology infrastructure, directly impacting millions of customers' experiences.

This is an excellent opportunity for an experienced SRE to make a significant impact at one of the world's most innovative technology companies, working on systems that power Apple's retail operations globally.

Last updated 4 months ago

Responsibilities For Software Engineer (Site Reliability), Retail Engineering

  • Ensure reliability, scalability, and performance of systems and services
  • Work with engineering and operations teams to design, build, and maintain infrastructure
  • Participate in design reviews and operational readiness exercises
  • Collaborate with technical and non-technical teams to analyze system statistics
  • Lead incident management and resolution for production issues
  • Participate in on-call rotation
  • Implement and maintain monitoring and alerting systems
  • Automate manual operations and improve through iteration

Requirements For Software Engineer (Site Reliability), Retail Engineering

Python
Cassandra
Linux
PostgreSQL
  • 4 years of experience in incident management for large-scale applications
  • 4 years of strong troubleshooting and debugging skills
  • 4 years of experience with observability tools like Splunk and Prometheus
  • 4 years of proficiency in Python or similar scripting languages
  • BS in Computer Science or equivalent work experience
  • Experience with Oracle and Cassandra databases
  • Strong problem solving and communication skills
  • Willingness to participate in on-call rotations

Benefits For Software Engineer (Site Reliability), Retail Engineering

Medical Insurance
Dental Insurance
Vision Insurance
  • Equal opportunity employer
  • Reasonable accommodation for disabilities
  • Comprehensive benefits package