Zyphra is seeking an HPC Systems Engineer to lead their machine learning infrastructure efforts. This role combines traditional systems engineering with cutting-edge AI/ML infrastructure management. The position involves maintaining and developing core infrastructure for machine learning research and production, managing Linux-based cluster environments, and ensuring smooth operation of critical systems. The ideal candidate will have strong Linux administration skills, experience with containerization and job scheduling, and the ability to work across various technical domains. This role offers the opportunity to directly impact developer productivity and ML training performance at a company working on advanced AI technologies. The position requires both technical expertise in systems engineering and strong collaborative abilities to work with various teams.