Annapurna Labs, an Amazon company, is seeking a Senior Machine Learning Engineer to join their AWS Neuron Distributed Training team. This role focuses on developing and optimizing machine learning solutions for AWS's custom silicon accelerators - Trainium and Inferentia. The position involves working with cutting-edge ML technologies, including Large Language Models (LLM) like GPT and Llama, as well as Stable Diffusion and Vision Transformers.
The role combines deep technical expertise in machine learning with hands-on software development, requiring proficiency in distributed training frameworks like FSDP, Deepspeed, and Nemo. You'll work closely with cross-functional teams including chip architects and compiler engineers to build and optimize distributed training solutions.
AWS Neuron represents the complete software stack for AWS's cloud-scale Machine Learning accelerators, and this position offers the opportunity to work on next-generation AI infrastructure. The team maintains a strong culture of mentorship and knowledge-sharing, with emphasis on career growth and professional development.
As part of Amazon Web Services (AWS), the world's leading cloud platform, you'll be at the forefront of cloud computing innovation. The role offers competitive compensation, comprehensive benefits, and the chance to work on technology that powers some of the world's most successful businesses.
The ideal candidate will bring strong software development skills, deep ML expertise, and the ability to collaborate effectively across teams. This is an opportunity to shape the future of machine learning infrastructure while working with some of the most advanced AI technologies available today.