AWS Neuron is seeking a Software Development Engineer II to join their Machine Learning Applications team, focusing on distributed training solutions. This role is part of Annapurna Labs, AWS's infrastructure provider, which was acquired in 2015. The position involves working on AWS Neuron, the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators.
The role combines deep technical expertise in machine learning with software engineering, requiring work on massive scale language models like GPT2/3 and vision transformers. You'll collaborate with chip architects and compiler engineers to optimize performance on AWS's custom silicon. The team emphasizes work-life balance and provides strong mentorship opportunities.
This is an exciting opportunity to work at the intersection of machine learning and distributed systems, developing solutions that will impact millions of users worldwide. The position offers competitive compensation ($129,300-$223,600 based on location) and comprehensive benefits. You'll be part of AWS's inclusive culture, with access to employee-led affinity groups and ongoing learning experiences.
The role requires 3+ years of software development experience, strong ML knowledge, and expertise in distributed training frameworks. You'll be responsible for building and optimizing ML solutions using PyTorch, TensorFlow, and AWS's custom hardware accelerators. This is a chance to work on cutting-edge ML infrastructure while being part of a team that values knowledge sharing and professional growth.