Amazon Consumer Tier One Support (C-TOS) is seeking a System Development Engineer to join their first line of defense for maintaining high availability in the Amazon Retail Website. This role is crucial in making customer impacting events shorter, less frequent, and less severe through large-scale event and incident management. Working with a globally distributed team across Austin, Dublin, and Sydney, you'll be part of a 24x7 coverage model working 10-hour shifts for 4 days a week.
The position combines hands-on development of automation tools with incident management responsibilities. You'll build tooling to automate the detection and resolution of issues within Amazon's Retail Website infrastructure, while also leading conference calls and directing the resolution of high-visibility incidents. The role offers the opportunity to make a significant impact at scale, as the Amazon Retail Website serves hundreds of millions of customers globally.
As part of the team, you'll work on projects to expand tooling usage across Amazon, analyze incident data to drive process improvements, and contribute to making future events less severe or preventable entirely. The team is rapidly growing and expanding its offerings globally, making it an exciting time to join.
The ideal candidate should have a strong background in infrastructure automation, experience with modern programming languages, and familiarity with Linux/Unix environments. Knowledge of CI/CD pipelines and experience with distributed systems at scale is highly valued. This role offers excellent growth potential and the opportunity to make a substantial impact on Amazon's global retail platform reliability.
Working at Amazon, you'll be part of a company known for its innovation, customer-centric approach, and robust technical infrastructure. The role offers the chance to work with cutting-edge technologies while solving complex problems that affect millions of customers worldwide.