Software Engineer AI/ML, AWS Neuron Distributed Training Team

AWS is a leading cloud infrastructure company that acquired Annapurna Labs in 2015 to strengthen its infrastructure capabilities.
$99,500 - $200,000
Machine Learning
Mid-Level Software Engineer
In-Person
1+ year of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Machine Learning Engineer, CreativeX

Machine Learning Engineer role at Amazon's CreativeX team, focusing on real-time ad personalization using advanced AI technologies with competitive compensation and benefits.

Software Dev Engineer II, AWS Healthcare AI

AWS Healthcare AI is seeking a Software Development Engineer II to build and maintain AI-powered healthcare services, offering competitive compensation and comprehensive benefits.

Software Dev Engineer II, AWS Healthcare AI

AWS Healthcare AI seeks Software Dev Engineer II to build and enhance AI-powered healthcare services, focusing on improving patient outcomes through cloud computing and artificial intelligence.

Software Dev Engineer II, AWS Healthcare AI

AWS Healthcare AI seeks Software Dev Engineer II to build and enhance AI-powered healthcare services, offering competitive pay and the chance to improve global healthcare outcomes.

Software Development Engineer II - DSO, (Level 5)

Software Development Engineer II position at Amazon's DSO team working on ML platforms and services for device demand forecasting.

Description For Software Engineer AI/ML, AWS Neuron Distributed Training Team

AWS Neuron is seeking a talented Software Engineer to join their Machine Learning Applications team, focusing on cloud-scale machine learning accelerators. This role is part of AWS's innovative Annapurna Labs division, which serves as the infrastructure backbone of AWS.

The position involves working with cutting-edge ML technologies, including large language models like GPT2 and GPT3, stable diffusion, and Vision Transformers. You'll be responsible for developing integrations with the Neuron SDK and major frameworks such as TensorFlow, PyTorch, and MXNet.

AWS offers a highly inclusive work environment with ten employee-led affinity groups and various innovative benefit offerings. The team strongly emphasizes work-life balance, believing that finding the right equilibrium between personal and professional life is crucial for long-term success and happiness.

The role provides excellent opportunities for growth and development, with a team structure that celebrates knowledge sharing and mentorship. Projects are assigned strategically to help team members develop into well-rounded professionals and take on increasingly complex challenges.

You'll be working with AWS Neuron, the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators. This position offers the chance to work on significant projects that impact AWS's machine learning infrastructure and contribute to open-source projects.

The ideal candidate should have experience with ML infrastructure and systems, programming expertise in languages like C, C++, Java, or Perl, and familiarity with deep learning frameworks. The role offers competitive compensation based on location and experience, along with comprehensive benefits and potential equity compensation.

Join a team that's at the forefront of machine learning infrastructure, working on products that power AWS Nitro, ENA, EFA, Graviton, and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and scalable NVMe storage solutions.

Last updated 3 months ago

Responsibilities For Software Engineer AI/ML, AWS Neuron Distributed Training Team

  • Development, enablement and performance tuning of ML model families
  • Developing integrations with The Neuron SDK and frameworks
  • Planning and implementing new features
  • Working with customers to create innovative solutions
  • Contributing to open source projects

Requirements For Software Engineer AI/ML, AWS Neuron Distributed Training Team

Python
Java
  • B.S. Computer Science or related technical field
  • 1+ years Experience in ML Infrastructure and system
  • Experience with programming languages: C, C++, Java, or Perl
  • Experience with deep learning frameworks: TensorFlow, PyTorch, and MXNet

Benefits For Software Engineer AI/ML, AWS Neuron Distributed Training Team

Medical Insurance
  • Flexible working hours
  • Work-life balance focus
  • Mentorship opportunities
  • Career growth opportunities
  • Medical benefits
  • Financial benefits

Interested in this job?