Evals Platform Engineer

Apollo Research focuses on behavioral model evaluations and AI safety, specifically addressing deceptive alignment in AI systems.
DevOps
Mid-Level Software Engineer
In-Person
11 - 50 Employees
3+ years of experience
AI

Description For Evals Platform Engineer

Apollo Research is at the forefront of AI safety, focusing on evaluating and auditing AI systems to prevent deceptive alignment - where models appear aligned but may actually evade human oversight. As their Evals Platform Engineer, you'll be responsible for building and maintaining the secure infrastructure that powers their frontier AI evaluations. This role combines DevOps expertise with AI safety, requiring strong skills in cloud infrastructure, security, and software engineering.

The position offers an opportunity to work with a dedicated team of researchers and engineers in London, alongside the London Initiative for Safe AI (LISA). You'll be instrumental in accelerating research by building robust backend infrastructure and researcher-facing interfaces, while maintaining high security standards for sensitive data handling.

The role demands expertise in AWS, Kubernetes, and Infrastructure as Code, with a focus on Python development. You'll have significant autonomy in technical decision-making while collaborating with a team that values truth-seeking and constructive feedback. The company offers comprehensive benefits including unlimited vacation, provided meals, and professional development opportunities.

This is an ideal position for someone passionate about AI safety who wants to contribute to crucial research through infrastructure development. The role combines technical challenges with meaningful impact in ensuring the safe development of AI systems. With visa sponsorship available and potential relocation support through AI Futures Grants, this position is accessible to international candidates looking to join London's growing AI safety community.

Last updated 4 days ago

Responsibilities For Evals Platform Engineer

  • Design, implement, scale, and maintain infrastructure for running frontier LLM evals using Infrastructure as Code (IaC)
  • Choose and integrate appropriate technologies for infrastructure stack
  • Collaborate with software engineers to build internal software tools
  • Administer and secure internal AWS accounts
  • Help set up and manage organisation-wide security processes

Requirements For Evals Platform Engineer

Python
Kubernetes
Linux
  • Experience leading infrastructure projects from start to finish
  • Experience implementing security best practices for cloud and containerised environments
  • Solid knowledge of AWS, including IAM and EKS
  • Strong hands-on experience with Kubernetes
  • Experience with Infrastructure as Code tools
  • Strong software engineering skills, preferably in Python

Benefits For Evals Platform Engineer

Visa Sponsorship
Education Budget
Relocation Benefits
  • Competitive UK-based salary
  • Flexible work hours and schedule
  • Unlimited vacation
  • Unlimited sick leave
  • Lunch, dinner, and snacks provided on workdays
  • Paid work trips including staff retreats and conferences
  • $1,000 USD yearly professional development budget
  • Visa sponsorship available
  • Up to £10,000 relocation cost reimbursement through AI Futures Grants

Interested in this job?

Jobs Related To Apollo Research Evals Platform Engineer

Sys Dev Engineer

Sys Dev Engineer role at Amazon Mesa team focusing on infrastructure, deployment, and tools development to support global digital content delivery systems.

Associate IT Infrastructure Engineer II

Associate IT Infrastructure Engineer II position at Collibra in Prague, focusing on Python development, Linux administration, and IT automation.

Quality and Automation Engineer 2

Quality and Automation Engineer position at Comcast focusing on software testing and automation.

System Infrastructure Developer

System Infrastructure Developer role at Apple, focusing on developing methodologies and automation for silicon development, offering $143K-$264K salary plus benefits.

CoreOS Quality Engineer (Private Cloud Compute - Server Operating Systems)

Quality Engineer position at Apple focusing on CoreOS and server operating systems testing, offering competitive salary range of $121,900-$214,500 in Cupertino.