Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Amazon is a global technology company providing cloud computing, e-commerce, AI, and digital streaming services.
$129,300 - $223,600
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Description For Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

AWS Neuron is seeking a Software Development Engineer to join their Machine Learning Applications (ML Apps) team, focusing on the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators. This role is central to developing and optimizing performance for various ML model families, including large language models like Llama2, GPT2, GPT3, as well as stable diffusion and Vision Transformers.

The position involves working closely with compiler and runtime engineers to create distributed inference solutions using Trn1. You'll be responsible for optimizing inference performance for both latency and throughput on large models using Python, PyTorch, and JAX. Experience with Deepspeed and other distributed inference libraries is essential.

As part of a startup-like development environment, you'll build high-impact solutions for a large customer base, participate in design discussions, conduct code reviews, and collaborate with internal and external stakeholders. The team emphasizes knowledge-sharing and mentorship, providing opportunities for career growth through increasingly complex technical challenges.

The role combines deep technical expertise in machine learning systems with practical software engineering, requiring strong skills in C++/Python and comprehensive ML knowledge. You'll be working at the forefront of ML infrastructure, helping to shape the future of cloud-based machine learning acceleration.

Amazon offers a competitive compensation package including base pay ranging from $129,300 to $223,600 depending on location, plus equity, sign-on payments, and comprehensive benefits. Join a team that's dedicated to innovation and technical excellence in the rapidly evolving field of machine learning infrastructure.

Last updated 2 days ago

Responsibilities For Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

  • Lead efforts building distributed inference support into PyTorch and TensorFlow
  • Tune ML models for highest performance on AWS Trainium and Inferentia silicon
  • Design and code solutions for software architecture efficiency
  • Create metrics and implement automation improvements
  • Resolve root cause of software defects
  • Participate in design discussions and code reviews
  • Work cross-functionally with stakeholders

Requirements For Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Python
Java
  • 3+ years of non-internship professional software development experience
  • 2+ years of system design and architecture experience
  • Experience programming with at least one software programming language
  • Experience with Python, PyTorch or JAX
  • Knowledge of distributed inference libraries like Deepspeed
  • Strong software development skills in C++/Python
  • ML knowledge and experience

Benefits For Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Medical Insurance
401k
  • Full range of medical benefits
  • Financial benefits
  • Equity compensation
  • Sign-on payments
  • Total compensation package

Interested in this job?

Jobs Related To Amazon Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Software Development Engineer, Finance Technology

Software Development Engineer position at Amazon's FinTech team, focusing on building AI/ML applications for financial data processing and analysis.

Machine Learning Engineer, Gen AI Innovation Center, AWS

AWS Machine Learning Engineer position focusing on developing and optimizing Large Language Models, working with enterprise customers to implement generative AI solutions.

Software Development Engineer, Alexa Customer Journeys

Software Development Engineer role at Amazon's Alexa Customer Journeys team, focusing on Generative AI and LLM technologies to enhance Alexa's capabilities and user experience.

Machine Learning Engineer II, Shopping Conversation Foundation

Machine Learning Engineer II position at Amazon focusing on building and optimizing large language models for conversational shopping experiences, offering competitive compensation and benefits.

Machine Learning Engineer, CreativeX

Machine Learning Engineer position at Amazon's CreativeX team, focusing on real-time ad personalization using advanced AI technologies including LLMs and computer vision.