Join Amazon's EC2 Instance Quality team, responsible for monitoring the health and availability of AWS's EC2 instances worldwide. This role involves working with systems at massive scale, processing and distributing critical infrastructure data that impacts a substantial portion of Internet servers globally. The team provides essential APIs and data for customer deployments, autoscaling, and incident response.
As a Software Development Engineer, you'll be part of a team that assesses and reports the health of every individual EC2 instance across AWS's global infrastructure. You'll develop scalable distributed systems, create tools for enhanced server health monitoring, and collaborate with various AWS teams to integrate instance health data into other products.
Amazon offers a strong culture of inclusion with ten employee-led affinity groups and various learning experiences. The team values work-life balance, offering flexible working hours and supporting personal and professional growth. You'll work alongside experienced engineers in an environment that celebrates knowledge sharing and mentorship.
The role combines technical challenges of large-scale distributed systems with direct customer impact. You'll be instrumental in ensuring the reliability of AWS's cloud infrastructure while working with cutting-edge technologies. The position offers competitive compensation, comprehensive benefits, and opportunities for career advancement in one of the world's leading cloud computing platforms.
Key aspects include architecting scalable solutions, developing monitoring tools, collaborating across teams, and participating in on-call rotations. The role requires strong software development skills, system design experience, and a passion for building reliable, large-scale distributed systems.