Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon

Amazon Web Services (AWS) builds cloud computing solutions and custom machine learning accelerators.

Cupertino, CA, USA

$129,300 - $223,600

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

AI · Enterprise SaaS

Job Description

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. This role is part of the Inference Enablement and Acceleration team, which is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators.

The position involves working across multiple technology layers - from frameworks and kernels to compiler, runtime, and collectives. You'll be responsible for development, enablement, and performance tuning of various LLM model families, including massive scale large language models like the Llama family, DeepSeek, and beyond.

As a Senior Software Development Engineer, you'll architect and implement business-critical features, mentor experienced engineers, and work in a unique learning culture where innovation and experimentation are encouraged. The role combines deep hardware knowledge with ML expertise to push the boundaries of AI acceleration technology.

Key responsibilities include building distributed inference support for PyTorch in the Neuron SDK, tuning models for highest performance, and maximizing efficiency on AWS Trainium and Inferentia silicon and servers. You'll collaborate with cross-functional teams, work directly with customers, and contribute to future architecture designs.

The team operates in a startup-like environment where you'll always work on the most important initiatives. We emphasize collaboration, technical ownership, and continuous learning. Our inclusive culture supports knowledge-sharing and mentorship, with opportunities for career growth through increasingly complex technical challenges.

The position offers competitive compensation ranging from $129,300 to $223,600 per year based on geographic location, plus equity, sign-on payments, and comprehensive benefits. Join us to solve some of the most interesting and impactful infrastructure challenges in AI/ML today.

Last updated 9 days ago

Responsibilities For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Design, develop, and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators
Participate in all stages of ML system development lifecycle
Build infrastructure to systematically analyze and onboard multiple models
Design and implement high-performance kernels for ML operations
Analyze and optimize system-level performance
Conduct detailed performance analysis using profiling tools
Implement optimizations such as fusion, sharding, tiling, and scheduling
Conduct comprehensive testing and continuous deployment
Work directly with customers to enable and optimize ML models
Collaborate across teams to develop innovative optimization techniques

Requirements For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Python

Bachelor's degree in computer science or equivalent
5+ years of non-internship professional software development experience
5+ years of non-internship design or architecture experience
Fundamentals of Machine learning and LLMs knowledge
Software development experience in C++, Python
Strong understanding of system performance, memory management, and parallel computing principles
Proficiency in debugging, profiling, and implementing best software engineering practices

Benefits For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Medical Insurance

Dental Insurance

Vision Insurance

401k

Medical Insurance
Dental Insurance
Vision Insurance
401k

Amazon

Amazon Web Services (AWS) builds cloud computing solutions and custom machine learning accelerators.

Cupertino, CA, USA

$129,300 - $223,600

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

AI · Enterprise SaaS

Amazon

Senior Software Engineering role at Amazon focusing on Machine Learning and AGI, leading development of inference and evaluation systems for the Nova family of models.

Sr. Software Dev. Engineer/MLE, AGI Customization

Amazon

Senior Machine Learning Engineer role at Amazon's AGI team, focusing on LLM customization, fine-tuning, and model distillation, requiring 5+ years of software development experience.

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon

Senior Software Development Engineer position at AWS focusing on AI/ML acceleration, working on the Neuron SDK for deep learning and GenAI workload optimization.

Sr Software Dev Engineer, Deep Learning Compilers

Amazon

Senior Software Engineer role at Amazon focusing on deep learning compiler development for Neural Edge processors.

Software Engineer-AI/ML, AWS Neuron Inference

Amazon

Senior Software Engineering role at AWS Neuron team focusing on ML inference optimization and LLM performance tuning.

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon

Job Description

Responsibilities For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Requirements For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Benefits For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon

Related Jobs