The Generative AI Innovation Center at AWS is at the forefront of empowering customers to leverage cutting-edge AI technologies for transformative business opportunities. As a Machine Learning Engineer on our team, you'll be instrumental in developing custom Large Language Models (LLMs) across various domains and modalities. The role involves working with a multidisciplinary team of strategists, scientists, engineers, and architects to fine-tune and deploy customized generative AI applications at scale.
You'll be responsible for designing and implementing distributed training pipelines for LLMs, utilizing advanced tools like Fully Sharded Data Parallel (FSDP) and DeepSpeed. The position requires expertise in adapting LLMs for new languages and domains through continued pre-training, fine-tuning, and Reinforcement Learning with Human Feedback (RLHF). A key aspect of the role involves optimizing AI models for deployment on AWS's custom AI accelerators, including Inferentia and Trainium.
Working at AWS offers unique advantages, including the opportunity to pioneer cloud computing innovations and work with the world's most comprehensive cloud platform. The company values work-life harmony and provides a flexible working culture that supports both professional and personal growth. AWS's inclusive team culture promotes curiosity and connection, with employee-led affinity groups and inclusion events that foster stronger, more collaborative teams.
The position offers competitive compensation ranging from $129,300 to $223,600 per year, depending on location and experience. AWS provides comprehensive benefits including medical coverage, career development opportunities, and mentorship programs. You'll be part of a global team helping customers achieve more with AWS cloud technology while working on cutting-edge AI solutions.
This role is perfect for someone passionate about AI innovation, experienced in machine learning, and eager to work with enterprise customers to solve complex technical challenges. Join us to shape the future of AI technology while working with industry-leading cloud infrastructure.