System Development Engineer, AGI - Modeling Services

A global technology company leading in e-commerce, cloud computing, and artificial intelligence
DevOps
Mid-Level Software Engineer
In-Person
3+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Systems Engineer, MES, Robotics IT

Systems Engineer position at Amazon Robotics leading MES implementation and support for manufacturing operations, requiring 2+ years of manufacturing systems experience.

Server Engineer | Data Center Operations, NRTE - DCO

Server Engineer position at Amazon Data Services Japan, focusing on data center operations and infrastructure management with responsibilities in hardware maintenance and system troubleshooting.

System Development Engineer, AFT - Platform Engineering & Services

System Development Engineer role at Amazon Fulfillment Technologies, focusing on developing flow control architecture for fulfillment centers with competitive compensation and benefits.

Server Engineer | Data Center Operations, DCO

Server Engineer position at Amazon Data Services Japan, focusing on data center operations and infrastructure maintenance for AWS, requiring hardware expertise and Linux/Windows knowledge.

System Development Engineer, Mechatronics & Sustainable Packaging

System Development Engineer role at Amazon's MSP team, focusing on automation and infrastructure for fulfillment centers, offering $89K-$185K salary plus benefits.

Description For System Development Engineer, AGI - Modeling Services

The Artificial General Intelligence (AGI) team at Amazon is seeking passionate and talented engineers to contribute to the development and maintenance of industry-leading multi-modal and multi-lingual large language models (LLM). This role focuses on supporting and enhancing the infrastructure that powers these cutting-edge AI systems. You'll be working with Amazon's hyper-scalable, general-purpose large model training and inference systems to develop and deploy state-of-the-art sensory AI foundational models.

The position combines DevOps expertise with AI infrastructure management, requiring strong skills in automation, system administration, and modern programming languages. You'll be responsible for ensuring the smooth operation of LLM infrastructure, implementing automation solutions, and driving operational excellence.

The role offers diverse opportunities for growth and specialization. Whether you excel in deep technical mastery, multi-tasking under pressure, process improvement, or focused coding, there's a place for your talents. You'll work alongside peers and senior leaders to define and improve operational standards across systems.

The ideal candidate will have experience with distributed systems, strong programming skills in languages like Python, Ruby, or Java, and expertise in AWS services and Kubernetes. This is an opportunity to be part of Amazon's innovative AGI team while adhering to their principle of "Work Hard. Have Fun. Make History."

The position offers the chance to work on cutting-edge AI technology while developing expertise in large-scale distributed systems and cloud infrastructure. You'll be part of a team that's pushing the boundaries of what's possible in artificial intelligence while maintaining robust and scalable systems.

Last updated 4 months ago

Responsibilities For System Development Engineer, AGI - Modeling Services

  • Provide support for cluster and node management of LLM infrastructure
  • Improve and automate cluster/capacity/maintenance upgrades
  • Develop automation tools for operational excellence
  • Work on operations and maintenance coding projects
  • Participate in design and code reviews
  • Troubleshoot and research root causes
  • Drive company-wide campaigns with Support and Engineering teams

Requirements For System Development Engineer, AGI - Modeling Services

Python
Java
Ruby
Linux
Kubernetes
  • 3+ years of administrative experience in networking, storage systems, and operating systems
  • Experience programming with modern languages (Python, Ruby, Golang, Java, C++, C#, Rust)
  • Experience with Linux/Unix
  • Experience with CI/CD pipelines build processes
  • Experience with distributed systems at scale

Interested in this job?