Research Engineer, Frontier Red Team (RSP Evaluations)

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development.

San Francisco, CA, USA • Seattle, WA, USA

$280,000 - $425,000

Machine Learning

Senior Software Engineer

Hybrid

501 - 1,000 Employees

5+ years of experience

AI · Cybersecurity

Description For Research Engineer, Frontier Red Team (RSP Evaluations)

Anthropic is seeking a Research Engineer for their Frontier Red Team to develop and implement critical safety evaluations for advanced AI systems. This role is central to Anthropic's Responsible Scaling Policy (RSP), focusing on ensuring the safe deployment of frontier AI models.

The position involves creating sophisticated evaluation systems to assess and control some of the most capable AI systems ever developed. You'll work across multiple crucial areas including biosecurity, autonomous replication, cybersecurity, and national security. Your responsibilities will include building, scaling, and running evaluations to measure potentially dangerous capabilities in models and determining when they cross ASL thresholds requiring enhanced security measures.

As a Research Engineer, you'll be at the forefront of AI safety, working in a hybrid environment between San Francisco or Seattle offices. The role offers competitive compensation ranging from $280,000 to $425,000 USD annually, along with comprehensive benefits including medical insurance, visa sponsorship, and parental leave.

The ideal candidate should have strong software engineering skills, particularly in Python, experience with distributed systems, and a background in conducting experiments with frontier AI models. You'll need to balance rapid prototyping with maintaining high engineering standards while working on unprecedented technical challenges.

This position represents a unique opportunity to influence AI safety standards across the industry while working with cutting-edge technology. The role requires a minimum of a Bachelor's degree or equivalent experience, with a hybrid work arrangement requiring at least 25% office presence. Anthropic values diversity and encourages applications from candidates of all backgrounds, emphasizing the importance of varied perspectives in addressing the social and ethical implications of AI development.

Last updated 3 months ago

Responsibilities For Research Engineer, Frontier Red Team (RSP Evaluations)

Design and implement robust evaluation infrastructure to measure model capabilities and risks across multiple domains
Lead technical projects to build and scale evaluation systems
Collaborate with domain experts to translate insights into concrete evaluation frameworks
Build sandboxed testing environments and automated pipelines for continuous model assessment
Work closely with researchers to rapidly prototype and iterate on new evaluation approaches
Partner with cross-functional teams to advance Anthropic's safety mission
Contribute to Capability Reports that inform critical deployment decisions

Requirements For Research Engineer, Frontier Red Team (RSP Evaluations)

Python

Have led and conducted fast, iterative experiments with frontier AI models
Have designed or implemented evaluations that involve sampling + prompting LLMs
Write clean, well-structured code that others can build upon
Have strong software engineering skills with extensive Python experience
Have experience working with distributed systems
Bachelor's degree in a related field or equivalent experience
Comfortable defining technical specifications and executing towards them
Self-starter who thrives in fast-paced, collaborative environments

Benefits For Research Engineer, Frontier Red Team (RSP Evaluations)

Medical Insurance

Visa Sponsorship

Parental Leave

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Office space in San Francisco
Visa sponsorship available

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development.

San Francisco, CA, USA • Seattle, WA, USA

$280,000 - $425,000

Machine Learning

Senior Software Engineer

Hybrid

501 - 1,000 Employees

5+ years of experience

AI · Cybersecurity

Research Engineer, Frontier Red Team (RSP Evaluations)

Anthropic

Description For Research Engineer, Frontier Red Team (RSP Evaluations)

Responsibilities For Research Engineer, Frontier Red Team (RSP Evaluations)

Requirements For Research Engineer, Frontier Red Team (RSP Evaluations)

Benefits For Research Engineer, Frontier Red Team (RSP Evaluations)

Anthropic

Jobs Related To Anthropic Research Engineer, Frontier Red Team (RSP Evaluations)