Taro Logo

Sr. Software Development Engineer, HPC/ML Networking Engineer, Annapurna Labs

An integral part of AWS that develops hardware and software components for EC2 infrastructure, specializing in designing software, systems and chips that optimize AWS customer experience.
$151,300 - $261,500
Senior Software Engineer
In-Person
5+ years of experience
AI · Enterprise SaaS

Job Description

We are seeking an experienced Senior Software Development Engineer to join Annapurna Labs, a crucial part of AWS that develops hardware and software components for EC2 infrastructure. This role focuses on distributed AI/ML systems, specifically working on collective operations that enable AI to scale across multiple accelerators & servers.

The position involves working with C/C++ in a low-level environment, requiring solid knowledge of Linux, kernels, and performance optimization. Experience with embedded systems and high-speed networking or HPC interconnects is highly valued. This role puts you at the forefront of AI/ML development, working on features for the largest clusters and AI models.

As part of Annapurna Labs, you'll be working in a diverse, international team alongside infrastructure experts, hardware engineers, RTL engineers, scientists, and architects. The organization values mentorship, both receiving and providing guidance to team members. You'll have the opportunity to work with numerous principal-level engineers and directors, with clear paths for career growth.

The team maintains a fast-paced environment while prioritizing work-life balance as a core tenet. You'll be encouraged to continuously learn and grow, as the AI/ML field is rapidly evolving. The position offers competitive compensation ranging from $151,300 to $261,500 per year, depending on location and experience, plus comprehensive benefits.

Key responsibilities include developing distributed AI/ML systems, optimizing collective operations, writing performant code, collaborating across disciplines, and mentoring junior engineers. The ideal candidate will have 5+ years of professional software development experience, strong programming skills, and experience leading technical designs and architecture.

Join us if you're passionate about solving complex problems, want to work with cutting-edge AI/ML technology, and desire to make a significant impact on AWS's infrastructure. This role offers the unique opportunity to work on technology that powers some of the world's largest and most sophisticated AI/ML workloads while being part of a supportive, diverse, and innovative team.

Last updated 6 days ago

Responsibilities For Sr. Software Development Engineer, HPC/ML Networking Engineer, Annapurna Labs

  • Work on distributed AI/ML systems
  • Develop collective operations for AI scaling across multiple accelerators & servers
  • Write performant C/C++ code
  • Collaborate with infrastructure experts, hardware engineers, RTL engineers, scientists & architects
  • Mentor new and junior engineers
  • Work on features for large clusters and AI models

Requirements For Sr. Software Development Engineer, HPC/ML Networking Engineer, Annapurna Labs

Linux
  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture experience
  • 5+ years of full software development life cycle experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Solid knowledge of Linux, kernels, and performant code
  • Experience with embedded systems and high-speed networking or HPC interconnects preferred

Benefits For Sr. Software Development Engineer, HPC/ML Networking Engineer, Annapurna Labs

Medical Insurance
401k
  • Medical Insurance
  • 401k