Software Engineer - Generative AI, AGIF | Runtime Services

Global technology company leading in e-commerce, cloud computing, AI, and digital streaming.
$129,300 - $223,600
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Description For Software Engineer - Generative AI, AGIF | Runtime Services

Join Amazon's cutting-edge Generative AI team to advance state-of-the-art technology that will benefit all Amazon businesses and customers. As a Software Development Engineer, you'll work with a talented team of engineers and scientists in a highly collaborative environment. The role focuses on building best-in-class, fast, accurate, and cost-efficient large language model inference solutions and infrastructure.

Your responsibilities will include designing and implementing high-performance inference capabilities, working with multi-modality and SOTA model architectures, and optimizing for latency, throughput, and cost. You'll have the opportunity to read research papers, experiment with new algorithms, and implement production-grade solutions. The position requires close collaboration with scientists and other engineering teams.

The role offers competitive compensation ranging from $129,300 to $223,600 based on location, plus equity and other benefits. You'll be part of Amazon's mission to push the boundaries of Generative AI technology while working with some of the most advanced AI systems. This is an excellent opportunity for someone passionate about AI/ML who wants to make a significant impact at scale.

The ideal candidate should have strong software development experience, knowledge of machine learning and deep learning, and a track record of performance optimization. Experience with Large Language Model inference, GPU programming, and Python/C++ development is highly valued. You'll be expected to maintain high standards in operational excellence and continuously innovate to improve system efficiency.

Last updated 7 hours ago

Responsibilities For Software Engineer - Generative AI, AGIF | Runtime Services

  • Design, develop, test, and deploy high performance inference capabilities
  • Collaborate with engineers and scientists to influence overall strategy
  • Define team's roadmap
  • Drive system architecture
  • Spearhead best practices
  • Mentor junior engineers
  • Implement production grade solutions
  • Support production systems

Requirements For Software Engineer - Generative AI, AGIF | Runtime Services

Python
  • 3+ years of non-internship professional software development experience
  • 2+ years of design or architecture experience
  • Experience programming with at least one software programming language
  • Knowledge of Machine Learning and Deep Learning
  • Experience with software performance optimization

Benefits For Software Engineer - Generative AI, AGIF | Runtime Services

Medical Insurance
Equity
  • Medical benefits
  • Financial benefits
  • Equity compensation available
  • Sign-on payments available

Interested in this job?

Jobs Related To Amazon Software Engineer - Generative AI, AGIF | Runtime Services

Systems Engineer, AI/ML

Systems Engineer position at AWS focusing on AI/ML services, combining cloud infrastructure expertise with artificial intelligence systems support.

Software Engineer- AI/ML, AWS Neuron

Software Engineer position for AWS Neuron team working on AI/ML infrastructure and distributed training solutions.

Software Engineer- AI/ML, AWS Neuron Distributed Training

Senior Software Engineer position at AWS Neuron focusing on distributed training solutions for machine learning, working with cutting-edge ML accelerators and frameworks.

Software Development Engineer, Ring AI

Software Development Engineer position at Ring AI (Amazon) in Iasi, Romania, focusing on computer vision and machine learning infrastructure for smart home security solutions.

Systems Development Engineer, AI/ML

Systems Development Engineer position at AWS focusing on AI/ML services, involving cloud infrastructure automation, system operations, and development of large-scale distributed systems.