Do you want to be part of AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to cutting-edge infrastructure. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia and Trainium, our custom chips designed to accelerate deep-learning workloads.
As a Senior Machine Learning Compiler Engineer in the AWS Neuron team, you will be responsible for building next generation Neuron compiler which transforms ML models written in ML frameworks (e.g., PyTorch, TensorFlow, and JAX) to be deployed on AWS Inferentia and Trainium based servers in the Amazon cloud.
Your role will involve solving complex compiler optimization problems to achieve optimum performance for various ML model families, including massive scale large language models like Llama, Deepseek, and beyond, as well as stable diffusion and vision transformers. You'll need to understand these models inside-out to make informed decisions on compiler optimizations.
Key Responsibilities:
The team operates in a startup-like environment, focusing on high-impact projects. We value knowledge-sharing and mentorship, with senior members providing one-on-one guidance and thorough code reviews. Career growth is emphasized through challenging projects and continuous learning opportunities.
AWS offers comprehensive benefits including medical insurance, 401k, parental leave, and more. We embrace diversity through employee-led affinity groups and foster an inclusive culture that celebrates our differences.
This is an opportunity to work at the forefront of AI infrastructure, helping shape the future of machine learning acceleration while being part of AWS's innovative culture. Join us in democratizing access to cutting-edge AI infrastructure and making deep learning accessible to developers worldwide.
Required Qualifications:
Preferred Qualifications: