Annapurna Labs, a crucial part of AWS, is seeking an experienced engineer to work on distributed AI/ML systems. This role focuses on developing collective operations that enable AI to scale across multiple accelerators & servers. The position involves working with C/C++ in a low-level environment, requiring solid knowledge of Linux, kernels, and performance optimization.
The team is at the forefront of AI/ML development, working on features for the largest clusters and AI models. You'll be part of a diverse, international workforce, collaborating with infrastructure experts, hardware engineers, RTL engineers, scientists & architects. The organization values mentorship, both receiving and providing guidance to team members.
Key Responsibilities:
The role offers:
Required Qualifications:
The position is ideal for candidates passionate about solving complex problems in AI/ML infrastructure, with a focus on performance optimization and scalability. You'll be working in an environment that values innovation, mentorship, and professional growth while contributing to cutting-edge technology that powers AWS services.