Research Engineer, Frontier Red Team (RSP Evaluations)

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development.
$280,000 - $425,000
Machine Learning
Senior Software Engineer
Hybrid
501 - 1,000 Employees
5+ years of experience
AI · Cybersecurity

Description For Research Engineer, Frontier Red Team (RSP Evaluations)

Anthropic is seeking a Research Engineer for their Frontier Red Team to develop and implement critical safety evaluations for advanced AI systems. This role is central to Anthropic's Responsible Scaling Policy (RSP), focusing on ensuring the safe deployment of frontier AI models.

The position involves creating sophisticated evaluation systems to assess and control some of the most capable AI systems ever developed. You'll work across multiple crucial areas including biosecurity, autonomous replication, cybersecurity, and national security. Your responsibilities will include building, scaling, and running evaluations to measure potentially dangerous capabilities in models and determining when they cross ASL thresholds requiring enhanced security measures.

As a Research Engineer, you'll be at the forefront of AI safety, working in a hybrid environment between San Francisco or Seattle offices. The role offers competitive compensation ranging from $280,000 to $425,000 USD annually, along with comprehensive benefits including medical insurance, visa sponsorship, and parental leave.

The ideal candidate should have strong software engineering skills, particularly in Python, experience with distributed systems, and a background in conducting experiments with frontier AI models. You'll need to balance rapid prototyping with maintaining high engineering standards while working on unprecedented technical challenges.

This position represents a unique opportunity to influence AI safety standards across the industry while working with cutting-edge technology. The role requires a minimum of a Bachelor's degree or equivalent experience, with a hybrid work arrangement requiring at least 25% office presence. Anthropic values diversity and encourages applications from candidates of all backgrounds, emphasizing the importance of varied perspectives in addressing the social and ethical implications of AI development.

Last updated 4 days ago

Responsibilities For Research Engineer, Frontier Red Team (RSP Evaluations)

  • Design and implement robust evaluation infrastructure to measure model capabilities and risks across multiple domains
  • Lead technical projects to build and scale evaluation systems
  • Collaborate with domain experts to translate insights into concrete evaluation frameworks
  • Build sandboxed testing environments and automated pipelines for continuous model assessment
  • Work closely with researchers to rapidly prototype and iterate on new evaluation approaches
  • Partner with cross-functional teams to advance Anthropic's safety mission
  • Contribute to Capability Reports that inform critical deployment decisions

Requirements For Research Engineer, Frontier Red Team (RSP Evaluations)

Python
  • Have led and conducted fast, iterative experiments with frontier AI models
  • Have designed or implemented evaluations that involve sampling + prompting LLMs
  • Write clean, well-structured code that others can build upon
  • Have strong software engineering skills with extensive Python experience
  • Have experience working with distributed systems
  • Bachelor's degree in a related field or equivalent experience
  • Comfortable defining technical specifications and executing towards them
  • Self-starter who thrives in fast-paced, collaborative environments

Benefits For Research Engineer, Frontier Red Team (RSP Evaluations)

Medical Insurance
Visa Sponsorship
Parental Leave
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space in San Francisco
  • Visa sponsorship available

Interested in this job?

Jobs Related To Anthropic Research Engineer, Frontier Red Team (RSP Evaluations)

Machine Learning Systems Engineer, RL Engineering

Senior ML Systems Engineer role at Anthropic focused on building and improving reinforcement learning systems for AI model training

Research Engineer, Knowledge Team

Senior Research Engineer position at Anthropic focused on redesigning how AI systems interact with external data sources through innovative information architectures and LLM training.

Software Engineer

Senior Software Engineering role at Anthropic focusing on building and optimizing large-scale ML systems, with emphasis on AI safety and interpretability.

Senior Software Engineer - Windows AI Agent

Senior Software Engineer position at Microsoft focusing on Windows AI Agent development, specializing in scalable model infrastructure and cloud-based AI workflows.

Machine Learning Engineer

Senior Machine Learning Engineer role at Adobe, developing innovative ML models and deploying AI solutions for the Digital Experience platform. Salary range: $120,700-$228,600.