Software Engineer, Trust & Safety

Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.

London, UK

$265,000 - $370,000

Senior Software Engineer

Hybrid

3+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Software Engineer, Trust & Safety

Anthropic is seeking a Software Engineer for their Trust & Safety team to help build safety and oversight mechanisms for AI systems. This role focuses on developing systems to monitor models, prevent misuse, and ensure user well-being. Key responsibilities include building abuse detection infrastructure, creating monitoring systems for API partners, and implementing real-time safety mechanisms at scale.

The ideal candidate will have 3-10+ years of software engineering experience, preferably in integrity, spam, fraud, or abuse detection. Proficiency in SQL, Python, and data analysis tools is required, along with strong communication skills. A background in AI/ML systems, experience with machine learning frameworks, and familiarity with prompt engineering and adversarial inputs are considered strong assets.

Anthropic offers a competitive compensation package, including a salary range of £265,000 - £370,000 GBP, equity options, and comprehensive benefits. The company follows a hybrid work model, expecting staff to be in the office at least 25% of the time.

This role presents an opportunity to work on cutting-edge AI safety and ethics challenges, contributing to Anthropic's mission of creating reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society. Join a collaborative team of researchers, engineers, and policy experts working to shape the future of AI technology.

Last updated a year ago

Responsibilities For Software Engineer, Trust & Safety

Develop monitoring systems to detect unwanted behaviors from our API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review
Build abuse detection mechanisms and infrastructure
Surface abuse patterns to our research teams to harden models at the training stage
Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale
Analyze user reports of inappropriate content or accounts

Requirements For Software Engineer, Trust & Safety

Python

Bachelor's degree in Computer Science, Software Engineering or comparable experience
3-10+ years of experience in a software engineering position, preferably with a focus on integrity, spam, fraud, or abuse detection
Proficiency in SQL, Python, and data analysis tools
Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders

Benefits For Software Engineer, Trust & Safety

Medical Insurance

Dental Insurance

Vision Insurance

401k

Parental Leave

Comprehensive health, dental, and vision insurance for you and all your dependents
Pension contribution (matching 4% of your salary)
21 weeks of paid parental leave
Unlimited PTO
Health cash plan
Life insurance and income protection
Daily lunches and snacks in our office