Software Engineer- AI/ML, AWS Neuron Distributed Training

AWS infrastructure provider specializing in silicon engineering, hardware design, software, and operations.
$129,300 - $223,600
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Description For Software Engineer- AI/ML, AWS Neuron Distributed Training

AWS Neuron is seeking a Software Development Engineer II to join their Machine Learning Applications team, focusing on distributed training solutions for AWS Inferentia and Trainium cloud-scale ML accelerators. This role combines software development expertise with machine learning knowledge, working on cutting-edge AI infrastructure. You'll be responsible for developing and optimizing distributed training support across major frameworks like PyTorch, TensorFlow, and JAX, while collaborating with chip architects and compiler engineers. The position offers exposure to large-scale ML models including LLMs and vision transformers, making it ideal for engineers passionate about both software development and machine learning. Amazon provides a collaborative environment with strong emphasis on work-life balance, mentorship, and career growth. The team is part of Annapurna Labs, which was acquired by AWS in 2015 and has delivered significant products including AWS Nitro, Graviton, and ML accelerators. This role offers competitive compensation and comprehensive benefits, reflecting Amazon's commitment to employee well-being and professional development.

Last updated 5 hours ago

Responsibilities For Software Engineer- AI/ML, AWS Neuron Distributed Training

  • Build distributed training support into Pytorch, Tensorflow, JAX
  • Develop and maintain Neuron compiler and runtime stacks
  • Tune ML models for performance optimization
  • Work with chip architects and compiler engineers
  • Enable and performance tune various ML model families including LLMs

Requirements For Software Engineer- AI/ML, AWS Neuron Distributed Training

Python
  • 3+ years of non-internship professional software development experience
  • 3+ years of non-internship design or architecture experience
  • Experience programming with at least one software programming language
  • Deep Learning industry experience

Benefits For Software Engineer- AI/ML, AWS Neuron Distributed Training

Medical Insurance
  • Medical benefits
  • Financial benefits
  • Work-life balance
  • Mentorship & Career Growth
  • Employee-led affinity groups

Interested in this job?

Jobs Related To Amazon Software Engineer- AI/ML, AWS Neuron Distributed Training

Software Development Engineer II - DSO, Demand Science Optimization (DSO)

Software Development Engineer II position at Amazon's DSO team, focusing on ML-powered demand forecasting and supply management for Amazon Devices.

SDE II (Machine Learning), AGI Foundations

ML Engineer position at Amazon's AGI team focusing on LLM training and development, offering competitive salary and benefits in California locations.

Software Engineer- AI/ML, AWS Neuron

Software Engineer position at AWS Neuron team, focusing on ML infrastructure development and optimization for cloud-scale machine learning accelerators.

Software Development Engineer II, ML_AI

AWS SageMaker AI seeks SDE II to build next-gen AI platform, focusing on large-scale deep learning and distributed ML systems for global customers.

Software Engineer- AI/ML, AWS Neuron Machine Learning Distributed Training, ML Accuracy

Senior Software Engineer position at AWS Neuron team, focusing on distributed ML training systems and optimization for cloud-scale machine learning accelerators.