Taro Logo

Software Engineer-AI/ML, AWS Neuron Inference

Amazon is a global technology company that provides cloud computing, artificial intelligence, e-commerce and digital streaming services.
$129,300 - $223,600
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Job Description

AWS Neuron is seeking a Senior Software Engineer to join their Machine Learning Inference Applications team, working on the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators. This role offers an exciting opportunity to work at the cutting edge of LLM optimization and inference.

The position involves developing and optimizing core components of Large Language Model inference, including Attention mechanisms, MLP networks, Quantization techniques, Speculative Decoding, and Mixture of Experts. You'll work directly with massive models like Llama 3.3 70B, 3.1 405B, DBRX, and Mixtral, ensuring optimal performance and accuracy on Neuron devices.

What makes this role unique is the close collaboration with chip architects, compiler engineers, and runtime engineers, allowing you to influence the entire stack from hardware to software. The team culture strongly emphasizes knowledge-sharing and mentorship, with senior members providing one-on-one mentoring and thorough code reviews.

The role requires strong software development skills with at least 3 years of professional experience, deep understanding of machine learning fundamentals, and hands-on experience with model optimization. Experience with PyTorch or Jax, particularly in deploying LLMs in production environments, is highly valued.

Amazon offers competitive compensation ranging from $129,300 to $223,600 based on location and experience, plus equity and comprehensive benefits. The position is based in Seattle, WA, offering the opportunity to work with one of the world's leading cloud providers in a rapidly evolving field of AI/ML infrastructure.

This is an excellent opportunity for engineers passionate about machine learning optimization who want to work on cutting-edge technology that powers some of the most advanced AI models in production today.

Last updated 9 days ago

Responsibilities For Software Engineer-AI/ML, AWS Neuron Inference

  • Development and performance optimization of core building blocks of LLM Inference
  • Work on Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts
  • Collaborate with chip architects, compiler engineers and runtime engineers
  • Adapt latest research in LLM optimization to Neuron chips
  • Work on performance optimization for models like Llama 3.3 70B, 3.1 405B, DBRX, Mixtral

Requirements For Software Engineer-AI/ML, AWS Neuron Inference

Python
  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture experience
  • Experience programming with at least one software programming language
  • Fundamentals of Machine learning models knowledge
  • Experience with model architecture, training and inference lifecycles
  • Experience with model performance optimization

Benefits For Software Engineer-AI/ML, AWS Neuron Inference

Medical Insurance
  • Medical benefits
  • Financial benefits
  • Comprehensive benefits package

Related Jobs

Sr. Software Engineer (ML), AGI Foundations

Senior Software Engineering role at Amazon focusing on Machine Learning and AGI, leading development of inference and evaluation systems for the Nova family of models.

Sr. Software Dev. Engineer/MLE, AGI Customization

Senior Machine Learning Engineer role at Amazon's AGI team, focusing on LLM customization, fine-tuning, and model distillation, requiring 5+ years of software development experience.

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Senior Software Development Engineer position at AWS focusing on AI/ML acceleration, working on the Neuron SDK for deep learning and GenAI workload optimization.

Sr Software Dev Engineer, Deep Learning Compilers

Senior Software Engineer role at Amazon focusing on deep learning compiler development for Neural Edge processors.

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Senior Software Development Engineer role at AWS focusing on AI/ML acceleration, working on AWS Neuron SDK to optimize deep learning and GenAI workloads for custom ML accelerators.