Taro Logo

Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform, pioneering cloud computing and continuous innovation.
Machine Learning
Staff Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance kernels for ML functions, ensuring optimal performance for customer workloads.

The role involves working at the intersection of machine learning, high-performance computing, and distributed architectures. As a Sr. ML Kernel Performance Engineer, you'll be responsible for architecting and implementing business-critical features, publishing cutting-edge research, and mentoring experienced engineers. The team operates in a startup-like environment with small, agile teams focused on innovation and experimentation.

Key responsibilities include designing and implementing high-performance compute kernels, optimizing performance across multiple hardware generations, conducting detailed performance analysis, and working directly with customers to optimize their ML models. You'll collaborate across compiler, runtime, framework, and hardware teams to deliver optimal performance for machine learning workloads.

The position offers unique opportunities to:

  • Work on cutting-edge AI acceleration technology
  • Contribute to future architecture designs
  • Engage directly with customers for model optimization
  • Participate in a culture of innovation and experimentation
  • Work with a comprehensive ML toolkit including compiler, runtime, and framework components

AWS values diverse experiences and inclusive culture, offering flexibility in working hours and strong support for work-life balance. The team provides extensive mentorship opportunities and focuses on career growth through challenging projects and knowledge sharing.

The role is part of AWS's larger mission to pioneer cloud computing and continue pushing the boundaries of what's possible in AI acceleration. You'll be joining a team that combines deep hardware knowledge with ML expertise to deliver solutions for the most demanding AI workloads.

Last updated 15 days ago

Responsibilities For Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

  • Design and implement high-performance compute kernels for ML operations
  • Analyze and optimize kernel-level performance across multiple generations of Neuron hardware
  • Conduct detailed performance analysis using profiling tools
  • Implement compiler optimizations
  • Work directly with customers to enable and optimize their ML models
  • Collaborate across teams to develop innovative kernel optimization techniques

Requirements For Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

Python
  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language
  • 5+ years of leading design or architecture experience
  • Experience as a mentor, tech lead or leading an engineering team

Benefits For Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

  • Work-life balance
  • Flexible working hours
  • Mentorship opportunities
  • Career growth opportunities

Interested in this job?

Jobs Related To Amazon Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs