AWS Infrastructure Services is at the heart of Amazon's cloud operations, responsible for the design, planning, delivery, and operation of all AWS global infrastructure. The AWS Network Alerts team is seeking Software Development Engineers to build highly scalable monitoring systems for one of the world's largest and most complex networks. This role involves working on large-scale distributed systems that process trillions of events per hour, using anomaly detection and rule engines to proactively detect and remediate network impairments.
As an SDE II on the team, you'll be part of a world-class software development team operating like a startup within AWS. You'll build next-generation cloud monitoring solutions that ensure the reliability and fault-tolerance of AWS's network infrastructure. The role offers the opportunity to work with cutting-edge technologies and solve complex challenges in network monitoring and observability.
The position involves collaborating with senior, principal, and distinguished engineers, working on systems that process massive amounts of data, and developing intelligent monitoring solutions that keep AWS services running reliably worldwide. You'll be responsible for building and maintaining critical services that process trillions of events per hour, while ensuring high availability and scalability.
The ideal candidate should have strong experience in distributed systems, a passion for solving complex technical challenges, and the ability to work effectively in an agile environment. This role offers significant growth opportunities, working alongside some of the best minds in the industry while contributing to systems that power AWS's global infrastructure.
Key technologies and concepts you'll work with include anomaly detection systems, machine learning-based monitoring, distributed systems, and large-scale data processing. The role combines technical depth with business impact, as your work will directly contribute to maintaining AWS's network reliability and customer satisfaction.