Incident Management Engineer

Amazon is the earth's most customer-centric company, leading the world in cloud technologies through Amazon Web Services (AWS).
Cape Town, South AfricaWaterfall, 3610, South Africa
Cloud
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
Enterprise SaaS · Cloud
This job posting may no longer be active. You may be interested in these related jobs instead:
Technical Account Manager, Telco Vertical

Senior Technical Account Manager position at AWS, focusing on cloud services and customer success, requiring technical expertise and customer-facing skills.

Data Centre Chief Engineer, InfraOps Cluster Operations

Senior technical role overseeing AWS data centre operations, maintaining critical infrastructure, and leading engineering teams in the Thames Valley region.

Senior Software Engineer, IaC Provider Experience

Senior Software Engineer position at Amazon Web Services focusing on Infrastructure as Code and cloud services development.

Commissioning Engineer, AMER-Central ACx

Senior Data Center Commissioning Engineer role at AWS, overseeing infrastructure testing and validation with 5+ years experience required.

Systems Development Engineer, Amazon Elastic VMware Service(EVS)

Senior Systems Development Engineer role at Amazon working on VMware cloud infrastructure, offering competitive pay and benefits in San Francisco Bay Area.

Description For Incident Management Engineer

Amazon Web Services (AWS) is looking for an Incident Management Engineer to join their Enhanced Support Services (ES2) team. This role is part of the AWS Support organization and is dedicated to managing critical escalations, customer-facing communications, and handling large-scale customer impacting events. The ideal candidate will have a broad skill set, strong analytical acumen, solid technology experience, and excellent communication skills.

Key responsibilities include:

  • Driving the resolution of large-scale customer impacting incidents
  • Managing critical and complex customer escalations
  • Providing incident response and management for critical workloads
  • Contributing to Problem Records for customers
  • Conducting real-time proactive monitoring of customer metrics
  • Collaborating with stakeholders to improve customer experience
  • Leading projects and remote teams to drive operational improvements
  • Mentoring peers in technical and operational areas

The role requires:

  • 2+ years of demonstrable Major Incident / Problem Manager Experience
  • 1+ years of experience in Support Engineering, Network Engineering, Solutions Architecture, or similar IT role
  • Bachelor's degree in a related field or 3+ years of relevant work experience
  • Familiarity with Cloud services, focusing on high availability and fault-tolerant design
  • Experience with data manipulation and/or automation using Python, JavaScript, or shell scripting
  • Ability to work in ambiguous environments and drive collaborative projects

AWS values diversity and work-life harmony, offering a supportive and inclusive work environment with opportunities for mentorship and career growth. The core business hours are from 8am-5pm SAST, and the position is available in Cape Town and Waterfall City, South Africa.

Last updated a month ago

Responsibilities For Incident Management Engineer

  • Drive resolution of large-scale customer impacting incidents
  • Manage critical and complex customer escalations
  • Provide incident response and management for critical workloads
  • Contribute to Problem Records for customers
  • Conduct real-time proactive monitoring of customer metrics
  • Collaborate with stakeholders to improve customer experience
  • Lead projects and remote teams to drive operational improvements
  • Mentor peers in technical and operational areas

Requirements For Incident Management Engineer

Python
JavaScript
Linux
  • 2+ years of Major Incident / Problem Manager Experience
  • 1+ years in Support Engineering, Network Engineering, Solutions Architecture, or similar IT role
  • Bachelor's degree in related field or 3+ years relevant work experience
  • Familiarity with Cloud services
  • Experience with data manipulation and automation
  • Ability to work in ambiguous environments

Benefits For Incident Management Engineer

  • Career Growth
  • Mentorship
  • Inclusive Team Culture
  • Work-Life Balance

Interested in this job?