Anthropic is seeking an ML Infrastructure Engineer to join their Interpretability team. The role focuses on reverse engineering how trained models work, with the goal of making advanced AI systems safe through mechanistic understanding.
Key responsibilities include:
The ideal candidate will have:
Additional valuable experience includes:
This role offers the opportunity to work on cutting-edge AI interpretability research, collaborating across teams to improve the safety and understanding of large language models like Claude. The position is hybrid, requiring 25% in-office time, with locations in San Francisco, Seattle, or New York City.
Anthropic offers competitive compensation including salary, equity, and comprehensive benefits. They value diversity and encourage applicants from all backgrounds to apply, even if they don't meet every qualification.