Research Engineer, Pre-training

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development.
$315,000 - $340,000
Machine Learning
Staff Software Engineer
Hybrid
501 - 1,000 Employees
5+ years of experience
AI

Description For Research Engineer, Pre-training

Anthropic is seeking a Research Engineer to join their Pre-training team, focusing on developing the next generation of large language models. This role represents a unique opportunity to work at the intersection of cutting-edge AI research and practical engineering implementation.

The position involves working on critical aspects of AI development, including model architecture, algorithms, data processing, and optimizer development. As a Research Engineer, you'll be responsible for conducting research, implementing solutions, and leading small research projects while collaborating with team members on larger initiatives. The role requires expertise in Python and deep learning frameworks, with a preference for PyTorch experience.

The company offers a competitive salary range of $315,000 - $340,000 USD and provides comprehensive benefits including visa sponsorship, flexible working hours, and generous vacation time. The work environment is hybrid, requiring at least 25% in-office presence at one of their locations in San Francisco, Seattle, or New York City.

Anthropic stands out for its approach to AI research, viewing it as an empirical science similar to physics and biology. The company operates as a single cohesive team focused on large-scale research efforts, prioritizing impact and the development of steerable, trustworthy AI systems. Their research continues important work in areas such as GPT-3, Circuit-Based Interpretability, Multimodal Neurons, and AI Safety.

The ideal candidate will have an advanced degree in Computer Science, Machine Learning, or a related field, strong software engineering skills, and experience with large-scale machine learning systems. They should be comfortable balancing research goals with practical engineering constraints and have excellent communication skills for collaborative work.

Key projects include optimizing novel attention mechanisms, comparing compute efficiency of different Transformer variants, preparing large-scale datasets, scaling distributed training jobs, and creating interactive visualizations of model internals. The role offers an opportunity to contribute to the development of safe, ethical, and powerful artificial intelligence systems while working with a team committed to ensuring AI systems are aligned with human interests.

Anthropic values diversity and strongly encourages applications from candidates of all backgrounds, including those from underrepresented groups in tech. The company operates as a public benefit corporation and offers additional benefits such as equity donation matching and a collaborative office space for team interaction.

Last updated a day ago

Responsibilities For Research Engineer, Pre-training

  • Conduct research and implement solutions in model architecture, algorithms, data processing, and optimizer development
  • Independently lead small research projects while collaborating with team members on larger initiatives
  • Design, run, and analyze scientific experiments to advance understanding of large language models
  • Optimize and scale training infrastructure to improve efficiency and reliability
  • Develop and improve dev tooling to enhance team productivity
  • Contribute to the entire stack, from low-level optimizations to high-level model design

Requirements For Research Engineer, Pre-training

Python
Kubernetes
  • Advanced degree (MS or PhD) in Computer Science, Machine Learning, or related field
  • Strong software engineering skills with proven track record of building complex systems
  • Expertise in Python and experience with deep learning frameworks (PyTorch preferred)
  • Familiarity with large-scale machine learning, particularly language models
  • Ability to balance research goals with practical engineering constraints
  • Strong problem-solving skills and results-oriented mindset
  • Excellent communication skills and ability to work in collaborative environment
  • Care about the societal impacts of your work

Benefits For Research Engineer, Pre-training

Visa Sponsorship
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space for collaboration

Interested in this job?

Jobs Related To Anthropic Research Engineer, Pre-training

Machine Learning Systems Engineer, Encodings and Tokenization

Machine Learning Systems Engineer role at Anthropic focusing on developing and optimizing encodings and tokenization systems for AI model training.

Machine Learning Systems Engineer, Model APIs

Machine Learning Systems Engineer role at Anthropic focused on building and maintaining Model Evaluations infrastructure and Research Inference APIs.

Research Engineer, Pre-training

Research Engineer position at Anthropic focusing on pre-training large language models, combining cutting-edge AI research with practical engineering to develop safe and trustworthy AI systems.

Staff Machine Learning Engineer - ML Algorithms

Staff Machine Learning Engineer position at EarnIn, focusing on developing and deploying advanced ML solutions and LLMs for fintech applications, offering competitive compensation and hybrid work arrangement.

Senior Staff Software Engineer, Experimentation Platform

Senior Staff Software Engineer role at DoorDash focusing on building and scaling the Experimentation Platform using ML, AI, and statistical methodologies.