System Develpment Engineer, AGI - Infrastructure

A global technology company leading in e-commerce, cloud computing, AI, and digital streaming.
DevOps
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI

Description For System Develpment Engineer, AGI - Infrastructure

The Artificial General Intelligence (AGI) team at Amazon is seeking talented engineers to work on industry-leading multi-modal and multi-lingual large language models (LLM). This role combines DevOps and infrastructure engineering, focusing on supporting and scaling Amazon's AGI infrastructure.

As a System Development Engineer, you'll be responsible for the critical infrastructure that powers Amazon's AGI initiatives. You'll work with cutting-edge technologies in machine learning infrastructure, managing and optimizing cluster operations, and developing automation tools for improved operational excellence.

The position offers the opportunity to work with advanced AI systems while building and maintaining the infrastructure that makes it all possible. You'll be part of a team that values both deep technical expertise and the ability to collaborate effectively across multiple functional areas.

Key responsibilities include managing LLM infrastructure, automating processes, developing operational tools, and ensuring the smooth running of complex distributed systems. You'll use various technologies including Kubernetes, AWS services, and multiple programming languages to build robust solutions.

The ideal candidate combines strong systems engineering background with software development skills, bringing experience in Linux/Unix environments, modern programming languages, and CI/CD practices. This role offers the chance to make significant contributions to Amazon's AGI initiatives while working with some of the most advanced AI infrastructure in the industry.

Amazon offers a collaborative environment where you can grow your career while working on challenging problems at scale. The company's "Work Hard. Have Fun. Make History" philosophy encourages innovation and personal growth, with opportunities to master your domain or expand your skillset across multiple areas.

Last updated 4 minutes ago

Responsibilities For System Develpment Engineer, AGI - Infrastructure

  • Provide support for cluster and node management for LLM infrastructure
  • Improve and automate cluster/capacity/maintenance upgrades
  • Develop automation tools for operational excellence
  • Work on operations and maintenance coding projects
  • Drive Company Wide Campaigns with Support and Engineering teams
  • Participate in design and code reviews
  • Troubleshoot and research root causes

Requirements For System Develpment Engineer, AGI - Infrastructure

Python
Ruby
Java
Linux
Kubernetes
  • 3+ years of administrative experience in networking, storage systems, and operating systems
  • Experience programming with modern languages (Python, Ruby, Golang, Java, C++, C#, Rust)
  • Experience with Linux/Unix
  • Experience with CI/CD pipelines build processes

Interested in this job?

Jobs Related To Amazon System Develpment Engineer, AGI - Infrastructure

Support Engineer III, SPFT

Support Engineer III role at Amazon SPFT team, building and maintaining critical financial systems with focus on automation and scalability.

System Dev Engineer, Amazon Robotics

System Development Engineer role at Amazon Robotics focusing on designing and implementing controls for advanced warehouse automation systems, combining software engineering with industrial robotics.

System Development Engineer, AFT - Platform Engineering & Services

Systems Development Engineer role at Amazon Fulfillment Technologies, focusing on platform engineering and operational excellence for warehouse technology services.

Server Engineer | Data Center Operations

Server Engineer position at Amazon Data Services Japan, focusing on data center infrastructure maintenance and hardware lifecycle management in AWS facilities.

System Develpment Engineer, AGI - Infrastructure

System Development Engineer position at Amazon's AGI team, focusing on LLM infrastructure management and automation using Kubernetes, AWS, and modern programming languages.