Taro Logo

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon Web Services (AWS) builds cloud computing solutions and custom machine learning accelerators.
$129,300 - $223,600
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Job Description

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. This role is part of the Inference Enablement and Acceleration team, which is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators.

The position involves working across multiple technology layers - from frameworks and kernels to compiler, runtime, and collectives. You'll be responsible for development, enablement, and performance tuning of various LLM model families, including massive scale large language models like the Llama family, DeepSeek, and beyond.

As a Senior Software Development Engineer, you'll architect and implement business-critical features, mentor experienced engineers, and work in a unique learning culture where innovation and experimentation are encouraged. The role combines deep hardware knowledge with ML expertise to push the boundaries of AI acceleration technology.

Key responsibilities include building distributed inference support for PyTorch in the Neuron SDK, tuning models for highest performance, and maximizing efficiency on AWS Trainium and Inferentia silicon and servers. You'll collaborate with cross-functional teams, work directly with customers, and contribute to future architecture designs.

The team operates in a startup-like environment where you'll always work on the most important initiatives. We emphasize collaboration, technical ownership, and continuous learning. Our inclusive culture supports knowledge-sharing and mentorship, with opportunities for career growth through increasingly complex technical challenges.

The position offers competitive compensation ranging from $129,300 to $223,600 per year based on geographic location, plus equity, sign-on payments, and comprehensive benefits. Join us to solve some of the most interesting and impactful infrastructure challenges in AI/ML today.

Last updated 9 days ago

Responsibilities For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

  • Design, develop, and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators
  • Participate in all stages of ML system development lifecycle
  • Build infrastructure to systematically analyze and onboard multiple models
  • Design and implement high-performance kernels for ML operations
  • Analyze and optimize system-level performance
  • Conduct detailed performance analysis using profiling tools
  • Implement optimizations such as fusion, sharding, tiling, and scheduling
  • Conduct comprehensive testing and continuous deployment
  • Work directly with customers to enable and optimize ML models
  • Collaborate across teams to develop innovative optimization techniques

Requirements For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Python
  • Bachelor's degree in computer science or equivalent
  • 5+ years of non-internship professional software development experience
  • 5+ years of non-internship design or architecture experience
  • Fundamentals of Machine learning and LLMs knowledge
  • Software development experience in C++, Python
  • Strong understanding of system performance, memory management, and parallel computing principles
  • Proficiency in debugging, profiling, and implementing best software engineering practices

Benefits For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Medical Insurance
Dental Insurance
Vision Insurance
401k
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance
  • 401k

Related Jobs

Sr. Software Engineer (ML), AGI Foundations

Senior Software Engineering role at Amazon focusing on Machine Learning and AGI, leading development of inference and evaluation systems for the Nova family of models.

Sr. Software Dev. Engineer/MLE, AGI Customization

Senior Machine Learning Engineer role at Amazon's AGI team, focusing on LLM customization, fine-tuning, and model distillation, requiring 5+ years of software development experience.

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Senior Software Development Engineer position at AWS focusing on AI/ML acceleration, working on the Neuron SDK for deep learning and GenAI workload optimization.

Sr Software Dev Engineer, Deep Learning Compilers

Senior Software Engineer role at Amazon focusing on deep learning compiler development for Neural Edge processors.

Software Engineer-AI/ML, AWS Neuron Inference

Senior Software Engineering role at AWS Neuron team focusing on ML inference optimization and LLM performance tuning.