System Development Engineer, AWS Incident Response

Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform, pioneering cloud computing and continuous innovation.
DevOps
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
Enterprise SaaS · Cloud

Description For System Development Engineer, AWS Incident Response

AWS Infrastructure Services is seeking a System Development Engineer to join their AWS Incident Response team. This role is crucial in maintaining AWS's global infrastructure and ensuring cloud service reliability. You'll be part of the team that keeps the cloud running, supporting AWS data centers, servers, storage, and networking equipment.

As a System Development Engineer, you'll build automation tools for detecting and resolving infrastructure issues, lead high-visibility incident resolution, and drive continuous improvement based on incident learnings. You'll work with distributed systems at scale, participating in an on-call rotation that includes weekends and holidays.

The role combines hands-on technical work with incident management leadership. You'll use programming languages like Python, Ruby, Go, or Java to create tools that improve AWS's infrastructure reliability. The position offers significant growth potential within AWS's supportive team environment, with formal mentorship programs and opportunities to learn from experienced colleagues.

Key aspects of the role include:

  • Building and enhancing incident detection and management tools
  • Leading conference calls and remote teams during major incidents
  • Implementing automation to prevent or minimize future incidents
  • Participating in Agile development processes
  • Creating technical documentation and procedures
  • Mentoring team members

AWS offers comprehensive benefits, values work-life harmony, and maintains an inclusive culture that welcomes diverse perspectives. The company provides extensive resources for career development and knowledge-sharing, making it an ideal environment for engineers who want to make a significant impact while growing their careers.

This position is based in Sydney, Australia, and offers the opportunity to work with cutting-edge technology while solving complex infrastructure challenges at a global scale.

Last updated 2 minutes ago

Responsibilities For System Development Engineer, AWS Incident Response

  • Drive the resolution of large scale customer impacting issues as part of a team rotation, including some weekends and holidays
  • Design, build, and enhance incident detection and management tools
  • Participate in Agile sprints to evolve business processes and technologies
  • Create and review documentation; design new standard operating procedures
  • Identify and troubleshoot recurring platform issues and own projects to drive improvements
  • Mentor peers in your areas of technical and operational strength

Requirements For System Development Engineer, AWS Incident Response

Python
Ruby
Go
Java
Linux
  • Experience in automating, deploying, and supporting large-scale infrastructure
  • Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust
  • Experience with Linux/Unix
  • Experience with CI/CD pipelines build processes

Benefits For System Development Engineer, AWS Incident Response

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
  • Comprehensive medical, dental, and vision insurance
  • Parental leave
  • Work-life harmony focus
  • Career development resources
  • Mentorship programs

Interested in this job?

Jobs Related To Amazon System Development Engineer, AWS Incident Response

System Dev Engineer, Engineering & IT

System Dev Engineer role at Amazon supporting fulfillment technology systems, requiring 4+ years of software development experience and strong systems engineering knowledge.

System Development Engineer II

System Development Engineer II position at Amazon in Boston, MA, focusing on infrastructure automation, system reliability, and technical leadership with competitive compensation.

Systems Engineer, MES, Robotics IT

Systems Engineer position at Amazon Robotics leading MES implementation and support for manufacturing operations, requiring 2+ years of manufacturing systems experience.

Server Engineer | Data Center Operations, NRTE - DCO

Server Engineer position at Amazon Data Services Japan, focusing on data center operations and infrastructure management with responsibilities in hardware maintenance and system troubleshooting.

System Development Engineer, AFT - Platform Engineering & Services

System Development Engineer role at Amazon Fulfillment Technologies, focusing on developing flow control architecture for fulfillment centers with competitive compensation and benefits.