Join AWS's Machine Learning Infrastructure team at Annapurna Labs, where innovation meets scale in cloud computing. As a Senior Software Development Engineer, you'll lead the development of critical infrastructure that powers AWS's ML and High Performance Computing technologies. This role combines deep technical expertise with leadership opportunities, focusing on building and maintaining sophisticated monitoring and automation systems that ensure peak performance of AWS ML technologies.
The position offers a unique opportunity to work with cutting-edge technologies including AWS Trainium, Graviton, and Elastic Fabric Adapter (EFA). You'll be responsible for developing infrastructure that handles massive testing workloads, creating efficient automation systems, and building comprehensive monitoring solutions using advanced tools like AWS Managed Grafana and Athena.
Your work will directly impact the efficiency and reliability of AWS's ML and HPC offerings, as you develop solutions that help teams deliver better software faster. The role requires expertise in Python, TypeScript, and Linux, combined with strong experience in CI/CD pipelines and cluster management. You'll work in a collaborative environment where innovation is encouraged, and your ideas can shape the future of cloud computing.
AWS offers competitive compensation, comprehensive benefits, and a culture that values work-life harmony. You'll be part of a diverse, inclusive team that embraces continuous learning and professional growth. The position provides opportunities for mentorship, both giving and receiving, and allows you to work on projects that directly influence how customers implement ML and HPC workloads in the cloud.
If you're passionate about building scalable infrastructure, automating complex systems, and working with cutting-edge ML technologies, this role offers the perfect blend of technical challenge and career growth. Join us in making AWS the most efficient and cost-effective platform for AI at scale.