AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and the Trn1 and Inf1 servers. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team, focusing on development, enablement and performance tuning of various ML model families including large language models.
The position involves:
The team offers:
This is a key role combining software development expertise with machine learning knowledge, perfect for someone passionate about high-performance ML systems and distributed computing.