Software Engineer, Safeguards (London)

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development through research, engineering, and policy work.
$303,000 - $411,000
Security
Senior Software Engineer
Hybrid
501 - 1,000 Employees
3+ years of experience
AI

Description For Software Engineer, Safeguards (London)

Anthropic is seeking a Software Engineer for their Safeguards team in London, focusing on building safety and oversight mechanisms for AI systems. This role combines technical expertise with trust and safety to ensure AI systems remain beneficial and secure. The position involves developing monitoring systems, abuse detection, and safety mechanisms at scale.

The ideal candidate will have 3-8+ years of software engineering experience, particularly in integrity and abuse detection, with strong skills in Python and SQL. Experience with AI/ML systems, fraud detection, and security monitoring is highly valued. The role offers competitive compensation (£240,000 - £325,000) and benefits including flexible work arrangements and visa sponsorship.

Anthropic's mission centers on creating reliable, interpretable AI systems that benefit society. The company operates as a public benefit corporation, emphasizing collaborative research and empirical science approaches. They value diverse perspectives and encourage applications from all qualified candidates, particularly those from underrepresented groups.

Working at Anthropic means joining a cohesive team focused on large-scale research efforts, with an emphasis on impact and advancing goals of steerable, trustworthy AI. The hybrid work environment requires 25% office presence, fostering collaboration while maintaining flexibility. The company offers comprehensive benefits, including equity donation matching and generous leave policies, creating an attractive environment for those passionate about safe AI development.

Last updated 12 days ago

Responsibilities For Software Engineer, Safeguards (London)

  • Develop monitoring systems to detect unwanted behaviors from API partners
  • Build abuse detection mechanisms and infrastructure
  • Surface abuse patterns to research teams to harden models at training stage
  • Build robust multi-layered defenses for real-time safety mechanisms at scale
  • Analyze user reports of inappropriate content or accounts

Requirements For Software Engineer, Safeguards (London)

Python
  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 3-8+ years of software engineering experience, preferably in integrity, spam, fraud, or abuse detection
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills and ability to explain complex technical concepts

Benefits For Software Engineer, Safeguards (London)

Visa Sponsorship
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space for collaboration

Interested in this job?

Jobs Related To Anthropic Software Engineer, Safeguards (London)

Application Security Engineer

Senior Application Security Engineer role at Anthropic, focusing on securing AI systems and implementing security best practices in the software development lifecycle.

Senior Software Security Engineer

Senior Software Security Engineer role at Anthropic focusing on building security for AI systems and infrastructure

Senior Software Security Engineer

Senior Software Security Engineer role at Anthropic focused on building security for AI systems and infrastructure

Application Security Engineer

Senior Application Security Engineer role at Anthropic, focusing on securing AI systems and development processes, offering $300-320K salary with hybrid work options in SF, Seattle, or NYC.

Senior Software Security Engineer

Senior Software Security Engineer role at Anthropic in London, focusing on securing AI systems and infrastructure, offering £240,000-£255,000 with hybrid work model.