Taro Logo

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon Web Services (AWS) builds cloud computing infrastructure and machine learning solutions used by millions of customers globally.
$129,300 - $223,600
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Job Description

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators.

As a Senior Software Development Engineer, you will work across multiple technology layers - from frameworks and kernels to compiler, runtime, and collectives. You'll be responsible for development, enablement, and performance tuning of various LLM model families, including massive scale large language models. The role combines deep hardware knowledge with ML expertise to push the boundaries of AI acceleration.

Key responsibilities include architecting and implementing business-critical features, mentoring experienced engineers, and working directly with customers on model enablement. You'll collaborate with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trainium and Inferentia.

The team operates in a startup-like development environment, emphasizing collaboration, technical ownership, and continuous learning. You'll work at the intersection of machine learning, high-performance computing, and distributed architectures, helping shape the future of AI acceleration technology.

The position offers competitive compensation ranging from $129,300 to $223,600 per year based on location, plus equity and comprehensive benefits. This is an excellent opportunity for someone passionate about AI/ML infrastructure and optimization who wants to make a significant impact in the field of machine learning acceleration.

Last updated 3 days ago

Responsibilities For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

  • Design, develop, and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators
  • Participate in all stages of the ML system development lifecycle
  • Build infrastructure to systematically analyze and onboard multiple models
  • Design and implement high-performance kernels and features for ML operations
  • Analyze and optimize system-level performance across multiple generations of Neuron hardware
  • Conduct detailed performance analysis using profiling tools
  • Implement optimizations such as fusion, sharding, tiling, and scheduling
  • Work directly with customers to enable and optimize their ML models
  • Collaborate across teams to develop innovative optimization techniques

Requirements For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Python
  • Bachelor's degree in computer science or equivalent
  • 5+ years of non-internship professional software development experience
  • 5+ years of non-internship design or architecture experience
  • Fundamentals of Machine learning and LLMs
  • Software development experience in C++, Python
  • Strong understanding of system performance, memory management, and parallel computing principles
  • Proficiency in debugging, profiling, and implementing best software engineering practices

Benefits For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Medical Insurance
401k
  • Competitive base salary
  • Equity compensation
  • Medical benefits
  • 401k retirement plan
  • Comprehensive benefits package

Related Jobs

Sr. Software Engineer (ML), AGI Foundations

Senior Software Engineering role at Amazon focusing on Machine Learning and AGI, leading development of inference and evaluation systems for the Nova family of models.

Sr. Software Dev. Engineer/MLE, AGI Customization

Senior Machine Learning Engineer role at Amazon's AGI team, focusing on LLM customization, fine-tuning, and model distillation, requiring 5+ years of software development experience.

Sr Software Dev Engineer, Deep Learning Compilers

Senior Software Engineer role at Amazon focusing on deep learning compiler development for Neural Edge processors.

Software Engineer-AI/ML, AWS Neuron Inference

Senior Software Engineering role at AWS Neuron team focusing on ML inference optimization and LLM performance tuning.

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Senior Software Development Engineer role at AWS focusing on AI/ML acceleration, working on AWS Neuron SDK to optimize deep learning and GenAI workloads for custom ML accelerators.