Taro Logo

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon is a global technology leader in cloud computing, artificial intelligence, and e-commerce.
$151,300 - $261,500
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

AWS Neuron is seeking a Senior Software Development Engineer to join their Machine Learning Inference Model Enablement team. This role focuses on developing and optimizing large-scale machine learning models, particularly LLMs like the Llama family and DeepSeek, for AWS's cloud infrastructure.

The position involves working with AWS's proprietary Inferentia and Trainium accelerators, requiring expertise in both software development and machine learning optimization. You'll collaborate closely with compiler and runtime engineers to create distributed inference solutions, using technologies like Python, PyTorch, and JAX.

As a senior engineer, you'll lead initiatives to build distributed inference support for PyTorch in the Neuron SDK, focusing on maximizing performance and efficiency for customer workloads. The role demands strong software development skills in Python and deep knowledge of machine learning systems.

The team operates in a startup-like environment, prioritizing high-impact solutions for AWS's large customer base. You'll participate in design discussions, code reviews, and cross-functional collaboration while working with cutting-edge ML infrastructure.

Amazon offers competitive compensation, including a base salary range of $151,300 to $261,500 depending on location, plus equity and comprehensive benefits. The position is based in Cupertino, CA, and offers opportunities for career growth through mentorship and hands-on experience with advanced ML systems.

This role is ideal for experienced engineers passionate about machine learning infrastructure who want to impact how large-scale AI models are deployed and optimized in production environments. Join a team that values knowledge-sharing, mentorship, and technical excellence while working on some of the most challenging problems in ML infrastructure.

Last updated 2 days ago

Responsibilities For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

  • Lead efforts building distributed inference support for Pytorch in the Neuron SDK
  • Tune models for highest performance on AWS Trainium and Inferentia silicon and servers
  • Create metrics and implement automation improvements
  • Resolve root cause of software defects
  • Participate in design discussions and code reviews
  • Work cross-functionally to drive business decisions with technical input

Requirements For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Python
  • 5+ years of non-internship professional software development experience
  • 5+ years of system design and architecture experience
  • Knowledge of Machine learning and LLMs architecture, training and inference lifecycles
  • Experience with model execution optimizations
  • Experience programming with at least one software programming language
  • Experience with Python and ML development

Benefits For Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Medical Insurance
401k
  • Comprehensive medical benefits
  • 401k plan
  • Equity compensation
  • Mentorship opportunities
  • Career growth opportunities

Interested in this job?

Jobs Related To Amazon Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference