Taro Logo

Software Engineer-AI/ML, AWS Neuron Inference

Amazon is a global technology company that provides cloud computing, e-commerce, AI, and digital streaming services.
$129,300 - $223,600
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Description For Software Engineer-AI/ML, AWS Neuron Inference

AWS Neuron is seeking a Senior Software Engineer to join their Machine Learning Inference Applications team. This role focuses on developing and optimizing core components of Large Language Model (LLM) inference for AWS Inferentia and Trainium cloud-scale machine learning accelerators. The position involves working with cutting-edge LLM technology, including attention mechanisms, MLP, quantization, and speculative decoding.

The successful candidate will collaborate closely with chip architects, compiler engineers, and runtime engineers to maximize performance and accuracy across various models like Llama 3.3 70B, 3.1 405B, DBRX, and Mixtral. The team emphasizes knowledge-sharing and mentorship, providing opportunities for career growth through challenging projects and supportive code reviews.

This role offers competitive compensation ranging from $129,300 to $223,600 based on location and experience, plus additional benefits including equity, sign-on payments, and comprehensive medical coverage. The position is based in Seattle, WA, and requires at least 3 years of professional software development experience with strong fundamentals in machine learning model architecture and optimization.

Amazon's commitment to innovation in AI/ML technology, combined with the team's collaborative culture and focus on personal development, makes this an excellent opportunity for engineers passionate about advancing the field of machine learning inference at scale.

Last updated a day ago

Responsibilities For Software Engineer-AI/ML, AWS Neuron Inference

  • Development and performance optimization of core building blocks of LLM Inference
  • Working with chip architects, compiler engineers and runtime engineers
  • Adapting latest research in LLM optimization to Neuron chips
  • Performance optimization for models like Llama 3.3 70B, 3.1 405B, DBRX, Mixtral

Requirements For Software Engineer-AI/ML, AWS Neuron Inference

Python
  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture experience
  • Experience programming with at least one software programming language
  • Fundamentals of Machine learning models, their architecture, training and inference lifecycles

Benefits For Software Engineer-AI/ML, AWS Neuron Inference

Medical Insurance
401k
Equity
  • Medical benefits
  • Financial benefits
  • Equity compensation
  • Sign-on payments

Interested in this job?

Jobs Related To Amazon Software Engineer-AI/ML, AWS Neuron Inference