Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Annapurna Labs, an integral part of AWS, develops hardware and software components that are critical building blocks for EC2 infrastructure.
$129,300 - $223,600
Distributed Systems
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Description For Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Join Annapurna Labs, a crucial part of AWS, as a Senior Software Development Engineer focusing on distributed AI/ML systems. This role puts you at the forefront of AI/ML development, working on features for the largest clusters and AI models. You'll be developing collective operations that enable AI to scale across multiple accelerators and servers, primarily using C/C++ in a low-level environment.

The position requires strong expertise in Linux, kernels, and performance optimization. Your work will directly impact AWS's EC2 infrastructure, as every instance runs on hardware designed by Annapurna Labs. You'll collaborate with a diverse, international team of infrastructure experts, hardware engineers, RTL engineers, scientists, and architects.

The role offers significant growth opportunities, working alongside principal-level engineers and directors. The team values mentorship, knowledge-sharing, and maintains a strong work-life balance. You'll be part of a fast-paced environment focused on the latest AI/ML advancements while enjoying flexible working hours and a supportive team culture.

Key responsibilities include developing networking solutions for Machine Learning and High-Performance Computing workloads, mentoring junior engineers, and contributing to the full software development lifecycle. The ideal candidate brings 3+ years of software development experience, strong system architecture skills, and preferably experience with embedded systems and high-speed networking.

This is an excellent opportunity for someone passionate about low-level systems programming, distributed computing, and AI/ML infrastructure. You'll be working on cutting-edge technology that powers some of the world's largest AI workloads while being part of a team that values continuous learning and professional development.

Last updated 9 hours ago

Responsibilities For Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

  • Work on distributed AI/ML systems
  • Develop collective operations for AI scaling across multiple accelerators & servers
  • Build networking solutions for Machine Learning and High-Performance Computing workloads
  • Mentor new and junior engineers
  • Collaborate with infrastructure experts, hardware engineers, RTL engineers, scientists & architects

Requirements For Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Linux
  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture experience
  • Experience programming with at least one software programming language
  • Knowledge of Linux, kernels, and performant code
  • Experience with embedded systems (preferred)
  • Experience with high-speed networking or HPC interconnects (highly valued)

Benefits For Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Medical Insurance
  • Medical benefits
  • Work-life balance
  • Career growth opportunities
  • Mentorship programs
  • Flexible working hours

Interested in this job?

Jobs Related To Amazon Software Development Engineer, HPC/ML Interconnect Engineer, Annapurna Labs

Senior Software Development Engineer, Amazon MSK

Senior Software Development Engineer position for Amazon MSK team, building and maintaining large-scale Apache Kafka streaming services.

Sr. Software Development Engineer, Amazon Robotics

Senior Software Engineer role at Amazon Robotics, focusing on distributed systems for robotic movement control and optimization in fulfillment centers.

Sr. System Development Engineer, Amazon Sidewalk

Senior Systems Development Engineer position at Amazon working on Sidewalk IoT technology, focusing on system design and architecture across cloud, device, and mobile domains.

Sr. Software Dev Engineer, Kuiper

Senior Software Engineer role at Amazon Kuiper developing core wireless and networking systems for global satellite broadband connectivity.

Sr. Software Development Engineer, AWS DynamoDB

Senior Software Development Engineer position at AWS DynamoDB team, building scalable distributed database systems and cluster management solutions.