Join the innovative team at Annapurna Labs (AWS) as an EFA Network Software Engineer, where you'll be at the forefront of enabling next-generation Machine Learning in the cloud. Our team owns the critical user-space software powering the Elastic Fabric Adapter (EFA) network card, essential for Machine Learning and High-Performance Computing customers on AWS.
In this role, you'll work on multiple C-based projects that enable networking for thousands of GPU and CPU instances, handling the most demanding clustered workloads. You'll be writing high-performance code for open-source projects like Libfabric and Open MPI, collaborating with various teams to innovate new cloud networking APIs, and providing expert support to leading AI companies worldwide.
Your day-to-day responsibilities will include optimizing software stack performance, creating detailed technical designs, implementing comprehensive testing strategies, and working closely with the ML Infrastructure team to ensure optimal performance across large-scale machine clusters. The position offers competitive compensation ranging from $129,300 to $223,600 per year, depending on location and experience.
We're a dynamic, fast-paced team within AWS that stays ahead of AI industry trends, ensuring our products are ready for whatever challenges come next. We emphasize automation and focus on solving the most interesting problems as our customers push the boundaries of what our network can achieve. This is an excellent opportunity for growth, as we invest heavily in career development and supporting team members in reaching their highest potential.
The ideal candidate will bring 3+ years of professional software development experience, strong expertise in C programming, and a deep understanding of high-performance computing and networking concepts. If you're passionate about building cutting-edge networking solutions and want to impact the future of cloud computing and AI infrastructure, this role offers an exciting opportunity to work with the latest technologies while solving complex technical challenges.