Sys Dev Engineer - Infrastructure, Annapurna Labs

Annapurna Labs is an AWS organization building innovation in silicon and software for AWS customers, with development centers in the U.S. and Israel.
DevOps
Mid-Level Software Engineer
In-Person
3+ years of experience
AI · Enterprise SaaS · Hardware

Description For Sys Dev Engineer - Infrastructure, Annapurna Labs

Annapurna Labs, an innovative organization within AWS, is seeking a Systems Development Engineer to join their Infrastructure team. This role offers an exciting opportunity to work on next-generation cloud computing infrastructure, specifically focusing on Machine Learning Accelerators like AWS Inferentia2 and Trainium. The position combines software development with infrastructure management at massive scale, working alongside some of the industry's brightest minds in a fast-paced, startup-like environment.

The role involves building and maintaining critical infrastructure used by Annapurna engineers to design and develop hardware and software components. You'll be working on-site in Austin, Texas, contributing to the development of custom silicon and owning the infrastructure that enables this innovation. The position requires expertise in both software development and systems engineering, with a focus on cloud infrastructure, networking, and automation.

As a Systems Development Engineer, you'll be responsible for architecting and delivering solutions directly to internal customers, collaborating with multiple teams, and implementing large-scale infrastructure improvements. The work involves everything from networking and high-performance compute clusters to infrastructure automation of hardware/software/firmware testing and ASIC/EDA development.

This is an excellent opportunity for someone who wants to make a significant impact in cloud computing infrastructure while working with cutting-edge technology in machine learning acceleration. The role offers hands-on experience with AWS services and the chance to influence technical implementations across multiple teams. If you're passionate about building complete products from inception to customer delivery and want to be part of creating the world's most advanced Machine Learning Accelerators, this position at Annapurna Labs could be your next career move.

Last updated 8 minutes ago

Responsibilities For Sys Dev Engineer - Infrastructure, Annapurna Labs

  • Build Cloud-Scale Machine Learning Acceleration Infrastructure
  • Develop and execute infrastructure development plans
  • Solve critical infrastructure issues involving networking and compute clusters
  • Execute and scale next generation cloud infrastructure
  • Own design reviews for infrastructure development
  • Implement process improvements for team's agility and operations
  • Define new mechanisms for system health monitoring and automation
  • Develop and update operational runbooks
  • Participate in on-call rotations

Requirements For Sys Dev Engineer - Infrastructure, Annapurna Labs

Python
Go
Java
Linux
  • 2+ years of non-internship professional software development experience
  • 1+ years of designing or architecting new and existing systems experience
  • 3+ years of administrative experience in networking, storage systems, operating systems
  • Knowledge of systems engineering fundamentals
  • Experience programming with at least one modern language
  • Bachelor's degree in computer science or equivalent

Interested in this job?

Jobs Related To Annapurna Labs (U.S.) Inc. Sys Dev Engineer - Infrastructure, Annapurna Labs

ADC Engineer, Keya

AWS ADC Engineer position for the Keya team, supporting cloud services for US Government, requiring TS/SCI clearance and strong Linux/DevOps experience.

Technical Operations Engineer

Technical Operations Engineer role at Oracle Cloud Infrastructure focusing on managing and supporting production environments for government and sovereign cloud regions.

Systems Development Engineer, Edge Infrastructure Operations

Systems Development Engineer role at Google focusing on Edge Infrastructure Operations, managing and automating global infrastructure support for content delivery networks.

Production Systems Engineer, Cooling & Power

Production Systems Engineer role at Meta focusing on liquid cooling and power systems for AI infrastructure, combining hardware and software expertise with competitive compensation.

Technical Operations Engineer

Technical Operations Engineer position at Oracle in Japan, focusing on production environment management and cloud infrastructure support with 3-5+ years experience required.