Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Amazon is a global technology company that provides cloud computing, e-commerce, AI, and digital streaming services.

Seattle, WA, USA

$129,300 - $223,600

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

Description For Software Engineer-AI/ML, AWS Neuron Inference

AWS Neuron is seeking a Senior Software Engineer to join their Machine Learning Inference Applications team. This role focuses on developing and optimizing core components of Large Language Model (LLM) inference for AWS Inferentia and Trainium cloud-scale machine learning accelerators. The position involves working with cutting-edge LLM technology, including attention mechanisms, MLP, quantization, and speculative decoding.

The successful candidate will collaborate closely with chip architects, compiler engineers, and runtime engineers to maximize performance and accuracy across various models like Llama 3.3 70B, 3.1 405B, DBRX, and Mixtral. The team emphasizes knowledge-sharing and mentorship, providing opportunities for career growth through challenging projects and supportive code reviews.

This role offers competitive compensation ranging from $129,300 to $223,600 based on location and experience, plus additional benefits including equity, sign-on payments, and comprehensive medical coverage. The position is based in Seattle, WA, and requires at least 3 years of professional software development experience with strong fundamentals in machine learning model architecture and optimization.

Amazon's commitment to innovation in AI/ML technology, combined with the team's collaborative culture and focus on personal development, makes this an excellent opportunity for engineers passionate about advancing the field of machine learning inference at scale.

Last updated a day ago

Responsibilities For Software Engineer-AI/ML, AWS Neuron Inference

Development and performance optimization of core building blocks of LLM Inference
Working with chip architects, compiler engineers and runtime engineers
Adapting latest research in LLM optimization to Neuron chips
Performance optimization for models like Llama 3.3 70B, 3.1 405B, DBRX, Mixtral

Requirements For Software Engineer-AI/ML, AWS Neuron Inference

Python

3+ years of non-internship professional software development experience
2+ years of non-internship design or architecture experience
Experience programming with at least one software programming language
Fundamentals of Machine learning models, their architecture, training and inference lifecycles

Benefits For Software Engineer-AI/ML, AWS Neuron Inference

Medical Insurance

401k

Equity

Medical benefits
Financial benefits
Equity compensation
Sign-on payments

Amazon

Amazon is a global technology company that provides cloud computing, e-commerce, AI, and digital streaming services.

Seattle, WA, USA

$129,300 - $223,600

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

Interested in this job?

Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Description For Software Engineer-AI/ML, AWS Neuron Inference

Responsibilities For Software Engineer-AI/ML, AWS Neuron Inference

Requirements For Software Engineer-AI/ML, AWS Neuron Inference

Benefits For Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Jobs Related To Amazon Software Engineer-AI/ML, AWS Neuron Inference