AWS Utility Computing (UC) is at the forefront of cloud innovation, providing foundational services like S3 and EC2. This role is specifically within the AWS AI organization, focusing on Amazon SageMaker, which aims to simplify deep learning workloads in the cloud. As customers increasingly adopt LLMs and Generative AI, the team is building a next-generation AI platform to accelerate development.
As an SDE 2 on the SageMaker team, you'll be instrumental in designing and developing distributed machine learning systems at scale. You'll work closely with ML scientists and customers to shape strategy and define roadmaps. The role involves building innovative solutions for large language model training, optimizing distributed training performance, and maintaining a fully-managed service for training foundation models.
The position offers unique opportunities to work with cutting-edge AI technologies, collaborate with leading technology companies, and contribute to open-source communities like PyTorch and NVIDIA/GPU. You'll be part of AWS's larger mission to democratize AI and machine learning capabilities for businesses worldwide.
The team culture emphasizes learning, curiosity, and inclusion, with various employee-led affinity groups and ongoing learning experiences. AWS values work-life harmony and provides strong mentorship and career growth opportunities. The role combines technical leadership with hands-on development, making it ideal for engineers passionate about AI/ML infrastructure and distributed systems.
Working at Amazon Web Services means joining the world's most comprehensive cloud platform provider, with opportunities to influence how businesses worldwide adopt and implement AI technologies. The role offers exposure to large-scale systems, cutting-edge AI infrastructure, and the chance to work with a global customer base.