Join Amazon's innovative Annapurna Labs team as a Senior Machine Learning Engineer working on AWS Neuron, the complete software stack for AWS Trainium and Inferentia cloud-scale ML accelerators. This role focuses on distributed training development for cutting-edge ML models including Large Language Models (LLM), Stable Diffusion, and Vision Transformers. You'll collaborate with chip architects and software engineers to optimize performance on custom AWS silicon.
The position offers an excellent opportunity to work at the intersection of machine learning and hardware acceleration, developing solutions that push the boundaries of what's possible in AI training at scale. You'll be part of a team that values knowledge-sharing, mentorship, and career growth, working in an inclusive environment that celebrates diverse experiences and perspectives.
As part of AWS, you'll be contributing to the world's most comprehensive cloud platform, helping to pioneer new innovations in cloud computing. The role offers competitive compensation ranging from $151,300 to $261,500 based on location, plus equity and comprehensive benefits. The team maintains a strong focus on work-life harmony and provides extensive support for professional development.
Key technologies you'll work with include PyTorch, JAX, XLA, FSDP, Deepspeed, and Nemo, while developing solutions for AWS's custom ML accelerators like Trainium and Inferentia. This is an opportunity to make a significant impact on the future of machine learning infrastructure while working with some of the most advanced AI hardware and software systems available.