Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Annapurna Labs, an integral part of AWS, develops hardware and software components that are critical building blocks for EC2 infrastructure.
$129,300 - $223,600
Distributed Systems
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Description For Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Annapurna Labs, a crucial division of AWS, is seeking an experienced Software Development Engineer to work on cutting-edge distributed AI/ML systems. This role is at the forefront of AI/ML development, focusing on collective operations that enable AI to scale across multiple accelerators and servers.

The position requires expertise in C/C++ programming and low-level system development, with a strong foundation in Linux, kernels, and performance optimization. Experience with embedded systems and high-speed networking or HPC interconnects is highly valued. The role offers an opportunity to work on features for the largest clusters, with the most significant customers, and the largest AI models.

As part of Annapurna Labs, you'll be working in a diverse, international environment alongside infrastructure experts, hardware engineers, RTL engineers, scientists, and architects. The team develops critical hardware and software components that serve as building blocks for EC2 infrastructure, with every EC2 instance running some type of hardware designed by Annapurna Labs.

The organization emphasizes mentorship, both receiving and providing guidance to team members. Work-life balance is a core tenet, with flexible working hours and a supportive environment. Career growth opportunities are abundant, with access to principal-level engineers and directors. The fast-paced nature of AI/ML development ensures continuous learning and professional development.

Amazon offers a comprehensive compensation package, including competitive base pay ranging from $129,300 to $223,600 depending on location, plus equity, sign-on payments, and extensive benefits. The company values diverse experiences and backgrounds, fostering an inclusive culture through employee-led affinity groups and ongoing learning experiences.

This role represents a unique opportunity to impact cloud computing at scale while working with cutting-edge technology in AI and machine learning. The ideal candidate will combine technical expertise with a passion for solving complex problems and a commitment to team collaboration and mentorship.

Last updated 4 minutes ago

Responsibilities For Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

  • Work on distributed AI/ML systems
  • Develop collective operations for AI scaling across multiple accelerators & servers
  • Write performant C/C++ code
  • Collaborate with infrastructure experts, hardware engineers, RTL engineers, scientists & architects
  • Mentor new and junior engineers

Requirements For Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Linux
  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture experience
  • Experience programming with at least one software programming language
  • Knowledge of Linux, kernels, and performant code
  • Experience with high-speed networking or HPC interconnects (preferred)

Benefits For Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Medical Insurance
  • Medical benefits
  • Work-life balance
  • Career growth opportunities
  • Mentorship program
  • International work environment

Interested in this job?

Jobs Related To Amazon Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

AWS FSx Lustre - SDE III, AWS FSx Lustre

Senior Software Engineer role at Amazon Web Services building and operating high-performance distributed file systems for FSx Lustre service.

Sr. Software Development Engineer - Kuiper Network Services

Senior Software Development Engineer position at Amazon's Project Kuiper, building satellite network connectivity solutions using cloud infrastructure.

Software Development Engineer - Apache Flink Engineer, Amazon Managed Service for Apache Flink

Senior Software Engineer position at Amazon working on Apache Flink framework for stream processing, offering competitive compensation and comprehensive benefits.

Sr. Robotics Software Engineer, Amazon Robotics

Senior Robotics Software Engineer position at Amazon Robotics, focusing on building high-performance robotic systems and infrastructure in Berlin.

Senior Software Development Engineer - Amazon MSK, Managed Streaming Kafka

Senior Software Engineering role at Amazon Web Services working on MSK (Managed Streaming for Kafka) service, building scalable streaming data solutions and contributing to open source.