The Generative AI Innovation Center at AWS is at the forefront of empowering customers to leverage innovative AI technologies for transformative business opportunities. As a Senior Machine Learning Engineer, you'll be part of a multidisciplinary team working on cutting-edge LLM development and optimization.
Your role will involve designing and implementing distributed training pipelines for Large Language Models using advanced tools like Fully Sharded Data Parallel (FSDP) and DeepSpeed. You'll be responsible for adapting and fine-tuning LLMs for various applications, including new languages, domains, and vision applications, while also implementing Reinforcement Learning with Human Feedback (RLHF).
A key aspect of the role involves optimizing AI models for AWS's custom silicon (Inferentia and Trainium) using the AWS Neuron SDK and developing custom kernels for enhanced performance. You'll work directly with enterprise customers and foundational model providers to understand their challenges and co-develop tailored generative AI solutions.
AWS values diverse experiences and work-life harmony. The company offers comprehensive benefits, mentorship opportunities, and a strong focus on career growth. You'll be part of AWS Global Services, working alongside technical experts across dozens of countries to help customers achieve more with AWS cloud.
The role requires significant experience in software development, AI/ML technologies, and leadership capabilities. You'll be joining a team that's driving innovation in generative AI while working with top AWS clients to deliver next-generation AI solutions. The position offers competitive compensation and benefits, with opportunities to work on challenging problems at scale.