At Amazon Elastic Kubernetes Service (EKS), we are building a core set of services that enable customers to create and use Kubernetes at massive scale. This role is part of the EKS Runtime team, focusing on making EKS the most reliable platform for running AI/ML workloads on Kubernetes clusters with 10,000+ nodes.
As an EKS Runtime Engineer, you'll work on:
- Optimizing Amazon EKS accelerated machine images (AMI) for AI/ML workloads
- Developing comprehensive test suites for Kubernetes workloads
- Designing and implementing CI/CD pipelines for functional testing, load testing, and security scanning of EKS GPU machine images
- Collaborating with Kubernetes experts, Product Managers, and Applied Scientists
- Contributing to critical path code and participating in peer code reviews
- Supporting on-call responsibilities and improving service SLOs
This is an opportunity to:
- Work at massive scale with cloud computing technologies
- Gain deep expertise in the Kubernetes data plane ecosystem
- Build next-generation container platforms
- Join an exceptional team pushing the boundaries of container technology
- Make significant impact on how customers run AI/ML workloads
The role offers competitive compensation including base salary, equity, sign-on payments, and comprehensive benefits including medical, financial, and other perks. You'll be part of AWS, a leader in cloud computing, working with cutting-edge technologies and solving complex engineering challenges.