The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. This is a senior-level position within the Neuron Compiler organization, specifically in the performance team based in Toronto.
The role focuses on analyzing and optimizing system-level performance of machine learning models on AWS ML accelerators. The team conducts in-depth profiling and works across multiple layers of the technology stack - from frameworks and compilers to runtime and collectives - to meet and exceed customer requirements while maintaining a competitive edge in the market.
As a Sr. ML Compiler Engineer, you'll be working at the intersection of machine learning, high-performance computing, and distributed systems. The position involves architecting and implementing business-critical features, publishing innovative research, and mentoring experienced engineers. The team operates in a startup-like environment with small, agile teams focused on innovation and experimentation.
Key responsibilities include optimizing machine learning workloads for global customers, conducting detailed performance analysis, working directly with customers to enable and optimize their ML models, and designing compiler optimizations. You'll collaborate with cross-functional teams across compiler, runtime, and framework domains to develop innovative optimization techniques.
The role requires extensive experience in software development, compiler design, and system-level performance optimization. You should have a strong background in machine learning technologies and frameworks like PyTorch, OpenXLA, and JAX. The ideal candidate will have experience leading technical teams and mentoring other engineers.
AWS values diverse experiences and maintains an inclusive culture through employee-led affinity groups and ongoing learning experiences. The company offers work-life harmony and various career advancement resources, including mentorship opportunities and knowledge-sharing platforms.
This position is part of Amazon Development Centre Canada ULC and is located in Toronto, offering an opportunity to work with cutting-edge technology in machine learning acceleration while being part of AWS's larger mission to be Earth's Best Employer.