The Generative AI Innovation Center at AWS is at the forefront of empowering customers to leverage cutting-edge AI technologies for transformative business opportunities. As a Machine Learning Engineer, you'll join a multidisciplinary team of strategists, scientists, engineers, and architects who collaborate with customers across industries to fine-tune and deploy customized generative AI applications at scale.
Your role will focus on driving the development of custom Large Language Models (LLMs) across various languages, domains, and modalities. You'll be responsible for implementing distributed training pipelines using advanced tools like Fully Sharded Data Parallel (FSDP) and DeepSpeed, ensuring scalability and efficiency at massive scale. The position involves adapting and fine-tuning state-of-the-art LLMs through continued pre-training and Reinforcement Learning with Human Feedback (RLHF).
Working closely with AWS's custom AI accelerators, you'll optimize models for deployment on AWS Inferentia and Trainium, leveraging the AWS Neuron SDK and developing custom kernels for enhanced performance. This role offers a unique opportunity to collaborate directly with enterprise customers and foundational model providers, understanding their technical challenges and co-developing tailored generative AI solutions.
The position combines technical expertise with customer interaction, requiring both strong programming skills and the ability to communicate effectively with stakeholders. You'll work in a startup-like environment where you're always focused on high-impact projects, participating in design discussions, code reviews, and cross-functional collaboration to drive business solutions.
AWS offers comprehensive benefits including medical coverage, financial benefits, and work-life harmony. The company values diverse experiences and fosters an inclusive team culture through employee-led affinity groups and ongoing learning experiences. Career growth is supported through mentorship and knowledge-sharing opportunities, making this an ideal role for someone looking to advance their career in AI while working on cutting-edge technology at scale.