Taro Logo

ML Infrastructure Engineer, Interpretability

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
$280,000 - $625,000
Machine Learning
Senior Software Engineer
Hybrid
101 - 500 Employees
5+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:
Research Engineer, Frontier Red Team (RSP Evaluations)

Senior Research Engineer position at Anthropic focusing on AI safety evaluations and implementing responsible scaling policies for frontier AI models.

Software Senior Engineer - AI Engineering

Senior Software Engineer position at Dell Technologies focusing on AI Engineering, developing cutting-edge AI applications and solutions within the Office of the CTO Dojo team.

AI ADK Software Engineer

Senior AI software engineering role focusing on embedded deep learning and neural network optimization for automotive applications at Qualcomm China.

Senior Software Engineer, Sous Chef

Senior Software Engineer position at Toast, focusing on AI/ML solutions for restaurant management, offering $130K-$210K salary, hybrid work, and comprehensive benefits.

Senior Prediction and Planning Machine Learning Engineer - Autonomous Vehicles

Senior ML Engineer role at NVIDIA focusing on prediction and planning systems for autonomous vehicles, combining AI expertise with automotive technology.

Description For ML Infrastructure Engineer, Interpretability

Anthropic is seeking an ML Infrastructure Engineer to join their Interpretability team. The role focuses on reverse engineering how trained models work, with the goal of making advanced AI systems safe through mechanistic understanding.

Key responsibilities include:

  • Implementing and analyzing research experiments at scale
  • Optimizing research workflows for efficiency and reliability
  • Building tools and abstractions to support rapid experimentation
  • Developing infrastructure to improve model safety across teams

The ideal candidate will have:

  • 5-10+ years of software engineering experience
  • Proficiency in Python and at least one other programming language
  • Strong prioritization and problem-solving skills
  • Interest in machine learning research and its applications
  • Concern for the societal impacts and ethics of their work

Additional valuable experience includes:

  • Designing flexible codebases for quick experimentation
  • Optimizing large-scale distributed systems
  • Collaborating with researchers and ML engineers
  • Experience with language modeling, transformers, GPUs, or PyTorch

This role offers the opportunity to work on cutting-edge AI interpretability research, collaborating across teams to improve the safety and understanding of large language models like Claude. The position is hybrid, requiring 25% in-office time, with locations in San Francisco, Seattle, or New York City.

Anthropic offers competitive compensation including salary, equity, and comprehensive benefits. They value diversity and encourage applicants from all backgrounds to apply, even if they don't meet every qualification.

Last updated 10 months ago

Responsibilities For ML Infrastructure Engineer, Interpretability

  • Implement and analyze research experiments, both quickly in toy scenarios and at scale in large models
  • Set up and optimize research workflows to run efficiently and reliably at large scale
  • Build tools and abstractions to support rapid pace of research experimentation
  • Develop and improve tools and infrastructure to support other teams in using Interpretability's work to improve model safety

Requirements For ML Infrastructure Engineer, Interpretability

Python
Rust
Go
Java
  • 5-10+ years of experience building software
  • Highly proficient in at least one programming language (e.g., Python, Rust, Go, Java) and productive with python
  • Strong ability to prioritize and direct effort toward the most impactful work
  • Comfortable operating with ambiguity and questioning assumptions
  • Want to learn more about machine learning research and its applications
  • Care about the societal impacts and ethics of your work

Benefits For ML Infrastructure Engineer, Interpretability

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Comprehensive health, dental, and vision insurance
  • 401(k) plan with 4% matching
  • 22 weeks of paid parental leave
  • Unlimited PTO
  • Stipends for education, home office improvements, commuting, and wellness
  • Fertility benefits via Carrot
  • Daily lunches and snacks in office
  • Relocation support for those moving to the Bay Area

Interested in this job?