Taro Logo

Sr. Machine Learning - Compiler Engineer III, AWS Neuron, Annapurna Labs

AWS is the world's most comprehensive and broadly adopted cloud platform, pioneering cloud computing and continuously innovating.
$151,300 - $261,500
Machine Learning
Staff Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Sr ML Compiler Engineer, Annapurna Labs

Senior Machine Learning Compiler Engineer role at Amazon's Annapurna Labs, focusing on developing cutting-edge ML compiler technology for AWS ML accelerators and neural network optimization.

Sr SOC Verification Engineer, Annapurna ML

Senior SOC Verification Engineer role at AWS, focusing on custom chip design verification for cloud-scale machine learning acceleration.

Sr. Amazon Q Delivery Engineer, Amazon Q Customer Success Team (Q-CST)

Senior Amazon Q Delivery Engineer position focusing on implementing enterprise-scale Generative AI solutions using Amazon Q and AWS AI/ML services, requiring 7+ years of cloud architecture experience.

Sr. Deep Learning Compiler Engineer III, AWS Neuron, Annapurna Labs

Senior Deep Learning Compiler Engineer role at AWS Neuron team, focusing on ML infrastructure optimization, compiler development, and technical leadership in a collaborative environment.

Senior SoC Architectural Modeling Engineer, AWS Machine Learning Accelerators

Senior SoC Modeling Engineer position at AWS, developing functional models for machine learning accelerator chips, requiring expertise in C++/SystemC and hardware modeling.

Description For Sr. Machine Learning - Compiler Engineer III, AWS Neuron, Annapurna Labs

Do you want to be part of AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to cutting-edge infrastructure. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia and Trainium, our custom chips designed to accelerate deep-learning workloads.

As a Senior Machine Learning Compiler Engineer in the AWS Neuron team, you will be responsible for building next generation Neuron compiler which transforms ML models written in ML frameworks (e.g., PyTorch, TensorFlow, and JAX) to be deployed on AWS Inferentia and Trainium based servers in the Amazon cloud.

Your role will involve solving complex compiler optimization problems to achieve optimum performance for various ML model families, including massive scale large language models like Llama, Deepseek, and beyond, as well as stable diffusion and vision transformers. You'll need to understand these models inside-out to make informed decisions on compiler optimizations.

Key Responsibilities:

  • Design and implement innovative software solutions for the Neuron compiler
  • Collaborate with chip architects, runtime engineers, and ML teams
  • Work with open-source communities to influence industry-wide standards
  • Create compiler optimization and verification passes
  • Build developer-facing features for AWS accelerators
  • Analyze and resolve compiler defects

The team operates in a startup-like environment, focusing on high-impact projects. We value knowledge-sharing and mentorship, with senior members providing one-on-one guidance and thorough code reviews. Career growth is emphasized through challenging projects and continuous learning opportunities.

AWS offers comprehensive benefits including medical insurance, 401k, parental leave, and more. We embrace diversity through employee-led affinity groups and foster an inclusive culture that celebrates our differences.

This is an opportunity to work at the forefront of AI infrastructure, helping shape the future of machine learning acceleration while being part of AWS's innovative culture. Join us in democratizing access to cutting-edge AI infrastructure and making deep learning accessible to developers worldwide.

Required Qualifications:

  • 5+ years of professional software development experience
  • Strong programming skills in object-oriented languages
  • Experience leading technical designs and architectures
  • Full software development lifecycle expertise
  • Leadership experience as a mentor or tech lead

Preferred Qualifications:

  • Advanced degree in Computer Science
  • Compiler design experience
  • Familiarity with ML frameworks and tools
  • Open-source compiler expertise
Last updated 19 days ago

Responsibilities For Sr. Machine Learning - Compiler Engineer III, AWS Neuron, Annapurna Labs

  • Design, implement, test, deploy and maintain innovative software solutions for Neuron compiler
  • Work with chip architects, runtime/OS engineers, and ML Apps teams
  • Collaborate with open-source communities
  • Create compiler optimization and verification passes
  • Build features to surface AWS accelerator capabilities to developers
  • Implement tools to analyze numerical errors
  • Resolve compiler defects
  • Participate in design discussions and code reviews
  • Communicate with internal and external stakeholders

Requirements For Sr. Machine Learning - Compiler Engineer III, AWS Neuron, Annapurna Labs

Python
Java
  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language
  • 5+ years of leading design or architecture experience
  • 5+ years of full software development life cycle experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Master's degree or PhD in Computer Science or related field (preferred)
  • Experience in compiler design for CPU/GPU/Vector engines/ML-accelerators (preferred)
  • Experience with OpenSource compiler toolset like LLVM/MLIR (preferred)
  • Experience with PyTorch, OpenXLA, StableHLO, JAX, TVM (preferred)

Interested in this job?