Taro Logo

Evals Software Engineer (Infrastructure)

Apollo Research focuses on AI safety research, particularly evaluating and auditing frontier AI models for deceptive alignment risks.
DevOps
Senior Software Engineer
In-Person
11 - 50 Employees
5+ years of experience
AI
This job posting is no longer active. 😔

Job Description

Apollo Research, a leading AI safety research organization, is seeking a Senior Software Engineer to lead their infrastructure initiatives. This role focuses on building and maintaining the secure infrastructure foundation that powers frontier AI evaluations. As the infrastructure Lead, you'll have significant autonomy in technical decision-making and will directly enable the company's research mission through robust, scalable systems.

The position involves designing and implementing infrastructure for running frontier LLM evaluations, choosing appropriate technologies, and building internal tools for job orchestration and results storage. You'll work closely with researchers to understand and prepare for future infrastructure needs, particularly for agent deployments. Security is a key focus, as you'll be responsible for AWS account administration and organization-wide security processes.

Apollo Research specializes in behavioral model evaluations and auditing real-world AI models, with a particular focus on deceptive alignment - where models appear aligned but may be capable of evading human oversight. You'll be joining a talented team of researchers and engineers, working from their London office shared with the London Initiative for Safe AI (LISA).

The role offers competitive compensation, comprehensive benefits including unlimited vacation and sick leave, daily meals, and professional development opportunities. The company supports international candidates with visa sponsorship and relocation assistance through AI Futures Grants. The interview process is practical and focused on relevant technical skills rather than theoretical exercises.

This is an excellent opportunity for an experienced infrastructure engineer who wants to contribute to important work in AI safety while leading and scaling critical technical infrastructure.

Last updated 3 months ago

Responsibilities For Evals Software Engineer (Infrastructure)

  • Design, implement, scale, and maintain infrastructure for running frontier LLM evals using Infrastructure as Code (IaC)
  • Choose and integrate appropriate technologies for infrastructure stack
  • Build internal software tools for job orchestration, project access, and results storage
  • Collaborate with researchers to understand future infrastructure needs
  • Ensure evals run properly and debug issues across the technology stack
  • Administer and secure internal AWS accounts
  • Help set up and manage organisation-wide security processes
  • Co-create and lead the infrastructure team

Requirements For Evals Software Engineer (Infrastructure)

Python
Kubernetes
  • Strong software engineering background, preferably in Python
  • Experience leading infrastructure projects from start to finish
  • Strong hands-on experience with Kubernetes
  • Solid knowledge of AWS, including IAM and EKS
  • Experience implementing security best practices for cloud and containerised environments
  • Experience with Infrastructure as Code tools (e.g. Terraform)

Benefits For Evals Software Engineer (Infrastructure)

Visa Sponsorship
Relocation Benefits
  • Competitive UK-based salary
  • Flexible work hours and schedule
  • Unlimited vacation
  • Unlimited sick leave
  • Lunch, dinner, and snacks provided on workdays
  • Paid work trips including staff retreats and conferences
  • $1,000 USD yearly professional development budget
  • Visa sponsorship available
  • Up to £10,000 relocation support through AI Futures Grants