Research Engineer / Research Scientist, Finetuning

Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.

San Francisco Bay Area, CA, USA • New York, NY, USA • Seattle, WA, USA

$280,000 - $625,000

Machine Learning

Senior Software Engineer

Hybrid

101 - 500 Employees

5+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Research Engineer / Research Scientist, Finetuning

Anthropic is seeking a Research Engineer / Research Scientist for Finetuning to help construct and rapidly iterate on machine learning experiments to improve the behavior of powerful AI systems. This role involves working on cutting-edge research to make AI helpful, honest, and harmless through techniques like constitutional AI.

Key responsibilities include:

Developing novel finetuning techniques to improve language model behavior
Testing constitutional AI techniques at scale and measuring impacts
Building tooling and infrastructure for efficient fine-tuning experiments
Developing prompts and strategies to improve and test model behaviors
Running experiments that feed into key AI research and safety efforts

The ideal candidate will have:

Significant Python, machine learning, research engineering, or research experience
A preference for fast-moving collaborative projects with concrete goals
A results-oriented approach with flexibility and focus on impact
Willingness to take on tasks outside their job description
Care about the impact of AI and their work

Strong candidates may also have:

Prior experience with large language model finetuning (e.g. RLHF)
Experience with complex codebases and RL infrastructure
Experience authoring ML/NLP/AI alignment research papers or similar industry work

This role offers the opportunity to do creative, cutting-edge research on frontier models and see concrete improvements in AI performance and safety. The position can be more research or engineering focused depending on the candidate's background.

Anthropic offers competitive compensation including salary, equity, and benefits. They have a hybrid work policy requiring 25% in-office time, with a preference for Bay Area candidates but openness to other locations. The company culture emphasizes collaboration, impact, and advancing the long-term goals of creating steerable and trustworthy AI systems.

Last updated a year ago

Responsibilities For Research Engineer / Research Scientist, Finetuning

Develop novel finetuning techniques to improve language model behavior
Test constitutional AI techniques at scale and measure impacts
Build tooling and infrastructure for efficient fine-tuning experiments
Develop prompts and strategies to improve and test model behaviors
Run experiments that feed into key AI research and safety efforts

Requirements For Research Engineer / Research Scientist, Finetuning

Python

Significant Python, machine learning, research engineering, or research experience
Preference for fast-moving collaborative projects with concrete goals
Results-oriented approach with flexibility and focus on impact
Willingness to take on tasks outside job description
Care about the impact of AI and their work

Benefits For Research Engineer / Research Scientist, Finetuning

Equity

Medical Insurance

Dental Insurance

Vision Insurance

401k

Education Budget

Parental Leave

Equity
Health insurance
Dental insurance
Vision insurance
401(k) with 4% matching
22 weeks paid parental leave
Unlimited PTO
Education stipend
Home office improvement stipend
Commuting stipend
Wellness stipend
Fertility benefits
Daily lunches and snacks in office
Relocation support