Taro Logo

Software Engineer, Trust & Safety

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
$265,000 - $370,000
Senior Software Engineer
Hybrid
3+ years of experience
This job posting may no longer be active.

Description For Software Engineer, Trust & Safety

Anthropic is seeking a Software Engineer for their Trust & Safety team to help build safety and oversight mechanisms for AI systems. This role focuses on developing systems to monitor models, prevent misuse, and ensure user well-being. Key responsibilities include building abuse detection infrastructure, creating monitoring systems for API partners, and implementing real-time safety mechanisms at scale.

The ideal candidate will have 3-10+ years of software engineering experience, preferably in integrity, spam, fraud, or abuse detection. Proficiency in SQL, Python, and data analysis tools is required, along with strong communication skills. A background in AI/ML systems, experience with machine learning frameworks, and familiarity with prompt engineering and adversarial inputs are considered strong assets.

Anthropic offers a competitive compensation package, including a salary range of £265,000 - £370,000 GBP, equity options, and comprehensive benefits. The company follows a hybrid work model, expecting staff to be in the office at least 25% of the time.

This role presents an opportunity to work on cutting-edge AI safety and ethics challenges, contributing to Anthropic's mission of creating reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society. Join a collaborative team of researchers, engineers, and policy experts working to shape the future of AI technology.

Last updated 10 months ago

Responsibilities For Software Engineer, Trust & Safety

  • Develop monitoring systems to detect unwanted behaviors from our API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review
  • Build abuse detection mechanisms and infrastructure
  • Surface abuse patterns to our research teams to harden models at the training stage
  • Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale
  • Analyze user reports of inappropriate content or accounts

Requirements For Software Engineer, Trust & Safety

Python
  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 3-10+ years of experience in a software engineering position, preferably with a focus on integrity, spam, fraud, or abuse detection
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders

Benefits For Software Engineer, Trust & Safety

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Comprehensive health, dental, and vision insurance for you and all your dependents
  • Pension contribution (matching 4% of your salary)
  • 21 weeks of paid parental leave
  • Unlimited PTO
  • Health cash plan
  • Life insurance and income protection
  • Daily lunches and snacks in our office

Interested in this job?