AWS Incident Response is at the heart of high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by providing large scale event and incident management. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the tooling and automation. We also provide manual incident management for AWS and other Amazon groups, directing the resolution of an issue with service teams, and diving deep into those events to drive improvements to the tooling.
As a Support Engineer on the team, you will:
Key responsibilities include:
The AWS Incident Response (AIR) team is Amazon's central defense against large-scale incidents and drives operational excellence across all of Amazon businesses. Our engineers are front-and-center in driving down event duration through experience in operational excellence, current best practices and incident management tooling.
Join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers.