AWS Neuron team is seeking a Senior Machine Learning Compiler Engineer to join their innovative team working on ML accelerators at the forefront of AWS innovation. The role focuses on developing a cutting-edge compiler stack for deep learning frameworks like TensorFlow, PyTorch, and MXNET. The position involves working with AWS Machine Learning accelerators, including the Inferentia chip, which delivers best-in-class ML inference performance at the lowest cost in cloud, and Trainium, designed for ML training performance.
The successful candidate will be part of Amazon Annapurna Labs team, responsible for silicon development at AWS. They will work on optimizing complex neural net models on custom-built AWS hardware, collaborating with some of the brightest minds in engineering, research, and product communities. The role offers opportunities to architect and implement critical features, publish research, and mentor experienced engineers.
The position offers excellent work-life balance, with flexibility in working hours and a strong emphasis on personal and professional development. The team culture promotes inclusion, knowledge sharing, and mentorship, with opportunities for career growth through challenging projects and continuous learning. The role requires expertise in compiler development, machine learning, and team leadership, with opportunities to work on groundbreaking technology used by major customers like Snap, Autodesk, Amazon Alexa, and Amazon Rekognition.
This is an exceptional opportunity for a senior engineer passionate about machine learning, compiler optimization, and hardware acceleration to make a significant impact in the field of AI infrastructure. The role combines technical leadership with hands-on development, requiring both deep technical expertise and strong communication skills to collaborate with various AWS teams and contribute to the next generation of ML computing solutions.