Evals Platform Engineer

Apollo Research

Apollo Research focuses on behavioral model evaluations and AI safety, specifically addressing deceptive alignment in AI systems.

London, UK

DevOps

Mid-Level Software Engineer

In-Person

11 - 50 Employees

3+ years of experience

Description For Evals Platform Engineer

Apollo Research is at the forefront of AI safety, focusing on evaluating and auditing AI systems to prevent deceptive alignment - where models appear aligned but may actually evade human oversight. As their Evals Platform Engineer, you'll be responsible for building and maintaining the secure infrastructure that powers their frontier AI evaluations. This role combines DevOps expertise with AI safety, requiring strong skills in cloud infrastructure, security, and software engineering.

The position offers an opportunity to work with a dedicated team of researchers and engineers in London, alongside the London Initiative for Safe AI (LISA). You'll be instrumental in accelerating research by building robust backend infrastructure and researcher-facing interfaces, while maintaining high security standards for sensitive data handling.

The role demands expertise in AWS, Kubernetes, and Infrastructure as Code, with a focus on Python development. You'll have significant autonomy in technical decision-making while collaborating with a team that values truth-seeking and constructive feedback. The company offers comprehensive benefits including unlimited vacation, provided meals, and professional development opportunities.

This is an ideal position for someone passionate about AI safety who wants to contribute to crucial research through infrastructure development. The role combines technical challenges with meaningful impact in ensuring the safe development of AI systems. With visa sponsorship available and potential relocation support through AI Futures Grants, this position is accessible to international candidates looking to join London's growing AI safety community.

Last updated 4 days ago

Responsibilities For Evals Platform Engineer

Design, implement, scale, and maintain infrastructure for running frontier LLM evals using Infrastructure as Code (IaC)
Choose and integrate appropriate technologies for infrastructure stack
Collaborate with software engineers to build internal software tools
Administer and secure internal AWS accounts
Help set up and manage organisation-wide security processes

Requirements For Evals Platform Engineer

Python

Kubernetes

Linux

Experience leading infrastructure projects from start to finish
Experience implementing security best practices for cloud and containerised environments
Solid knowledge of AWS, including IAM and EKS
Strong hands-on experience with Kubernetes
Experience with Infrastructure as Code tools
Strong software engineering skills, preferably in Python

Benefits For Evals Platform Engineer

Visa Sponsorship

Education Budget

Relocation Benefits

Competitive UK-based salary
Flexible work hours and schedule
Unlimited vacation
Unlimited sick leave
Lunch, dinner, and snacks provided on workdays
Paid work trips including staff retreats and conferences
$1,000 USD yearly professional development budget
Visa sponsorship available
Up to £10,000 relocation cost reimbursement through AI Futures Grants

Apollo Research

Apollo Research focuses on behavioral model evaluations and AI safety, specifically addressing deceptive alignment in AI systems.

London, UK

DevOps

Mid-Level Software Engineer

In-Person

11 - 50 Employees

3+ years of experience

Interested in this job?

Jobs Related To Apollo Research Evals Platform Engineer

Sys Dev Engineer

Amazon

Sys Dev Engineer role at Amazon Mesa team focusing on infrastructure, deployment, and tools development to support global digital content delivery systems.

Associate IT Infrastructure Engineer II

Collibra

Associate IT Infrastructure Engineer II position at Collibra in Prague, focusing on Python development, Linux administration, and IT automation.

Quality and Automation Engineer 2

Comcast

Quality and Automation Engineer position at Comcast focusing on software testing and automation.

System Infrastructure Developer

Apple

System Infrastructure Developer role at Apple, focusing on developing methodologies and automation for silicon development, offering $143K-$264K salary plus benefits.

CoreOS Quality Engineer (Private Cloud Compute - Server Operating Systems)

Apple

Quality Engineer position at Apple focusing on CoreOS and server operating systems testing, offering competitive salary range of $121,900-$214,500 in Cupertino.