Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Amazon

Amazon is a global technology company providing cloud computing, e-commerce, AI, and digital streaming services.

Seattle, WA, USA

$129,300 - $223,600

Machine Learning

Mid-Level Software Engineer

In-Person

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

Description For Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

AWS Neuron is seeking a Software Development Engineer to join their Machine Learning Applications (ML Apps) team, focusing on the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators. This role is central to developing and optimizing performance for various ML model families, including large language models like Llama2, GPT2, GPT3, as well as stable diffusion and Vision Transformers.

The position involves working closely with compiler and runtime engineers to create distributed inference solutions using Trn1. You'll be responsible for optimizing inference performance for both latency and throughput on large models using Python, PyTorch, and JAX. Experience with Deepspeed and other distributed inference libraries is essential.

As part of a startup-like development environment, you'll build high-impact solutions for a large customer base, participate in design discussions, conduct code reviews, and collaborate with internal and external stakeholders. The team emphasizes knowledge-sharing and mentorship, providing opportunities for career growth through increasingly complex technical challenges.

The role combines deep technical expertise in machine learning systems with practical software engineering, requiring strong skills in C++/Python and comprehensive ML knowledge. You'll be working at the forefront of ML infrastructure, helping to shape the future of cloud-based machine learning acceleration.

Amazon offers a competitive compensation package including base pay ranging from $129,300 to $223,600 depending on location, plus equity, sign-on payments, and comprehensive benefits. Join a team that's dedicated to innovation and technical excellence in the rapidly evolving field of machine learning infrastructure.

Last updated 2 days ago

Responsibilities For Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Lead efforts building distributed inference support into PyTorch and TensorFlow
Tune ML models for highest performance on AWS Trainium and Inferentia silicon
Design and code solutions for software architecture efficiency
Create metrics and implement automation improvements
Resolve root cause of software defects
Participate in design discussions and code reviews
Work cross-functionally with stakeholders

Requirements For Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Python

Java

3+ years of non-internship professional software development experience
2+ years of system design and architecture experience
Experience programming with at least one software programming language
Experience with Python, PyTorch or JAX
Knowledge of distributed inference libraries like Deepspeed
Strong software development skills in C++/Python
ML knowledge and experience

Benefits For Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Medical Insurance

401k

Full range of medical benefits
Financial benefits
Equity compensation
Sign-on payments
Total compensation package

Amazon

Amazon is a global technology company providing cloud computing, e-commerce, AI, and digital streaming services.

Seattle, WA, USA

$129,300 - $223,600

Machine Learning

Mid-Level Software Engineer

In-Person

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

Interested in this job?

Jobs Related To Amazon Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Software Development Engineer, Finance Technology

Amazon

Software Development Engineer position at Amazon's FinTech team, focusing on building AI/ML applications for financial data processing and analysis.

Machine Learning Engineer, Gen AI Innovation Center, AWS

Amazon

AWS Machine Learning Engineer position focusing on developing and optimizing Large Language Models, working with enterprise customers to implement generative AI solutions.

Software Development Engineer, Alexa Customer Journeys

Amazon

Software Development Engineer role at Amazon's Alexa Customer Journeys team, focusing on Generative AI and LLM technologies to enhance Alexa's capabilities and user experience.

Machine Learning Engineer II, Shopping Conversation Foundation

Amazon

Machine Learning Engineer II position at Amazon focusing on building and optimizing large language models for conversational shopping experiences, offering competitive compensation and benefits.

Machine Learning Engineer, CreativeX

Amazon

Machine Learning Engineer position at Amazon's CreativeX team, focusing on real-time ad personalization using advanced AI technologies including LLMs and computer vision.