Senior Software Engineer, AWS Neuron Inference

Amazon is a global technology company leading in e-commerce, cloud computing, AI, and digital streaming.
$151,300 - $261,500
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior Software Engineer, AWS Neuron Inference

AWS Neuron is at the forefront of cloud-scale machine learning acceleration, providing the complete software stack for AWS Inferentia and Trainium accelerators. This Senior Software Engineering role is part of the Machine Learning Inference Applications team, where you'll be instrumental in pushing the boundaries of LLM performance optimization.

The position offers an exciting opportunity to work with cutting-edge technology, focusing on the development and optimization of core LLM inference components including Attention, MLP, Quantization, and Speculative Decoding. You'll be working with state-of-the-art models like Llama 3.3 70B, 3.1 405B, DBRX, and Mixtral, ensuring they perform optimally on Neuron devices.

What makes this role particularly appealing is the collaborative nature of the work - you'll be working directly with chip architects, compiler engineers, and runtime engineers, bridging the gap between hardware capabilities and software optimization. The team culture strongly emphasizes knowledge-sharing and mentorship, making it an ideal environment for both personal and professional growth.

The compensation package is highly competitive, ranging from $151,300 to $261,500 based on location and experience, plus additional benefits including equity, sign-on bonuses, and comprehensive medical coverage. Amazon's total compensation approach ensures you're well-rewarded for your contributions.

The role requires strong technical expertise with at least 5 years of software development experience and a deep understanding of machine learning fundamentals. You'll be joining a team that values both technical excellence and collaborative spirit, working on projects that directly impact the performance of AWS's machine learning infrastructure.

This position offers the unique opportunity to work at the intersection of machine learning and high-performance computing, making a significant impact on how AI models are deployed and optimized in production environments. If you're passionate about pushing the boundaries of ML performance and working with cutting-edge technology, this role provides the perfect platform to advance your career while contributing to groundbreaking developments in AI acceleration.

Last updated 6 hours ago

Responsibilities For Senior Software Engineer, AWS Neuron Inference

  • Development and performance optimization of core building blocks of LLM Inference
  • Work on Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts
  • Collaborate with chip architects, compiler engineers and runtime engineers
  • Adapt latest research in LLM optimization to Neuron chips
  • Work across teams and organizations

Requirements For Senior Software Engineer, AWS Neuron Inference

Python
Java
  • 5+ years of full software development life cycle experience
  • Bachelor's degree in computer science or equivalent
  • 5+ years of programming using modern programming languages (Java, C++, or C#)
  • Fundamentals of Machine learning models knowledge
  • Experience with object-oriented design

Benefits For Senior Software Engineer, AWS Neuron Inference

Medical Insurance
401k
  • Full range of medical benefits
  • Financial benefits
  • 401k
  • Equity compensation
  • Sign-on payments
  • Mentorship opportunities
  • Career growth opportunities

Interested in this job?

Jobs Related To Amazon Senior Software Engineer, AWS Neuron Inference

Senior Delivery Consultant - Application Developer, Data & Machine Learning, WWPS ProServe

Senior ML and cloud architecture role at AWS ProServe, combining technical expertise with consulting to help customers implement AWS solutions, focusing on machine learning and data processing systems.

Sr. Machine Learning Engineer, Amazon Q in QuickSight

Senior Machine Learning Engineer position at Amazon working on Q in QuickSight, focusing on LLM and NLP applications for business intelligence solutions.

Senior Software Development Engineer - Amazon Music Machine Learning

Senior Software Engineer role at Amazon Music focusing on machine learning and personalization systems to enhance music discovery and recommendations for millions of users globally.

Senior Software Development Engineer, Sponsored Products

Senior Software Development Engineer position at Amazon Ads, focusing on machine learning and large-scale systems for Sponsored Products, offering competitive compensation and growth opportunities.

Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

Senior ML Engineer role at Amazon's Annapurna Labs, focusing on distributed training development for AWS Neuron ML accelerators, working with cutting-edge AI models and custom silicon.