The Artificial General Intelligence (AGI) team at Amazon is seeking talented engineers to work on industry-leading multi-modal and multi-lingual large language models (LLM). This role combines DevOps and infrastructure engineering, focusing on supporting and scaling Amazon's AGI infrastructure.
As a System Development Engineer, you'll be responsible for the critical infrastructure that powers Amazon's AGI initiatives. You'll work with cutting-edge technologies in machine learning infrastructure, managing and optimizing cluster operations, and developing automation tools for improved operational excellence.
The position offers the opportunity to work with advanced AI systems while building and maintaining the infrastructure that makes it all possible. You'll be part of a team that values both deep technical expertise and the ability to collaborate effectively across multiple functional areas.
Key responsibilities include managing LLM infrastructure, automating processes, developing operational tools, and ensuring the smooth running of complex distributed systems. You'll use various technologies including Kubernetes, AWS services, and multiple programming languages to build robust solutions.
The ideal candidate combines strong systems engineering background with software development skills, bringing experience in Linux/Unix environments, modern programming languages, and CI/CD practices. This role offers the chance to make significant contributions to Amazon's AGI initiatives while working with some of the most advanced AI infrastructure in the industry.
Amazon offers a collaborative environment where you can grow your career while working on challenging problems at scale. The company's "Work Hard. Have Fun. Make History" philosophy encourages innovation and personal growth, with opportunities to master your domain or expand your skillset across multiple areas.