Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Amazon is a global technology company that provides cloud computing, artificial intelligence, e-commerce and digital streaming services.

Seattle, WA, USA

$129,300 - $223,600

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

Job Description

AWS Neuron is seeking a Senior Software Engineer to join their Machine Learning Inference Applications team, working on the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators. This role offers an exciting opportunity to work at the cutting edge of LLM optimization and inference.

The position involves developing and optimizing core components of Large Language Model inference, including Attention mechanisms, MLP networks, Quantization techniques, Speculative Decoding, and Mixture of Experts. You'll work directly with massive models like Llama 3.3 70B, 3.1 405B, DBRX, and Mixtral, ensuring optimal performance and accuracy on Neuron devices.

What makes this role unique is the close collaboration with chip architects, compiler engineers, and runtime engineers, allowing you to influence the entire stack from hardware to software. The team culture strongly emphasizes knowledge-sharing and mentorship, with senior members providing one-on-one mentoring and thorough code reviews.

The role requires strong software development skills with at least 3 years of professional experience, deep understanding of machine learning fundamentals, and hands-on experience with model optimization. Experience with PyTorch or Jax, particularly in deploying LLMs in production environments, is highly valued.

Amazon offers competitive compensation ranging from $129,300 to $223,600 based on location and experience, plus equity and comprehensive benefits. The position is based in Seattle, WA, offering the opportunity to work with one of the world's leading cloud providers in a rapidly evolving field of AI/ML infrastructure.

This is an excellent opportunity for engineers passionate about machine learning optimization who want to work on cutting-edge technology that powers some of the most advanced AI models in production today.

Last updated 9 days ago

Responsibilities For Software Engineer-AI/ML, AWS Neuron Inference

Development and performance optimization of core building blocks of LLM Inference
Work on Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts
Collaborate with chip architects, compiler engineers and runtime engineers
Adapt latest research in LLM optimization to Neuron chips
Work on performance optimization for models like Llama 3.3 70B, 3.1 405B, DBRX, Mixtral

Requirements For Software Engineer-AI/ML, AWS Neuron Inference

Python

3+ years of non-internship professional software development experience
2+ years of non-internship design or architecture experience
Experience programming with at least one software programming language
Fundamentals of Machine learning models knowledge
Experience with model architecture, training and inference lifecycles
Experience with model performance optimization

Benefits For Software Engineer-AI/ML, AWS Neuron Inference

Medical Insurance

Medical benefits
Financial benefits
Comprehensive benefits package

Amazon

Amazon is a global technology company that provides cloud computing, artificial intelligence, e-commerce and digital streaming services.

Seattle, WA, USA

$129,300 - $223,600

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

Amazon

Senior Software Engineering role at Amazon focusing on Machine Learning and AGI, leading development of inference and evaluation systems for the Nova family of models.

Sr. Software Dev. Engineer/MLE, AGI Customization

Amazon

Senior Machine Learning Engineer role at Amazon's AGI team, focusing on LLM customization, fine-tuning, and model distillation, requiring 5+ years of software development experience.

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon

Senior Software Development Engineer position at AWS focusing on AI/ML acceleration, working on the Neuron SDK for deep learning and GenAI workload optimization.

Sr Software Dev Engineer, Deep Learning Compilers

Amazon

Senior Software Engineer role at Amazon focusing on deep learning compiler development for Neural Edge processors.

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon

Senior Software Development Engineer role at AWS focusing on AI/ML acceleration, working on AWS Neuron SDK to optimize deep learning and GenAI workloads for custom ML accelerators.

Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Job Description

Responsibilities For Software Engineer-AI/ML, AWS Neuron Inference

Requirements For Software Engineer-AI/ML, AWS Neuron Inference

Benefits For Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Related Jobs