AWS Machine Learning accelerators are leading AWS innovation in Generative AI development. The role focuses on the Inferentia chip team, which delivers best-in-class ML inference performance at the lowest cloud cost. As a Machine Learning Compiler Engineer II on the AWS Neuron team, you'll be instrumental in developing and scaling a compiler for the world's largest ML workloads. The position involves working with cutting-edge technology, including the AWS Neuron Software Development Kit (SDK), which integrates with popular ML frameworks like PyTorch, TensorFlow, and MxNet.
The team values work-life balance and fosters an inclusive culture that embraces differences. You'll join a diverse group of experienced engineers working on ambitious goals to create revolutionary toolchain performance improvements. The role offers significant growth opportunities through mentorship and knowledge sharing, with projects assigned to help develop well-rounded professionals.
Key responsibilities include compiler optimization, pre-silicon design work, and collaboration with AWS ML services teams. The position requires strong technical communication skills and offers exposure to cutting-edge ML hardware and software development. The team supports major customers like Snap, Autodesk, Amazon Alexa, and Amazon Rekognition.
This role combines deep technical work with collaborative team environments, making it ideal for engineers passionate about ML infrastructure and compiler optimization. The position offers competitive compensation, comprehensive benefits, and the opportunity to work on technology that shapes the future of cloud ML computing.