Machine Learning Systems Engineer, Encodings and Tokenization

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development for users and society.
$300,000 - $405,000
Machine Learning
Staff Software Engineer
Hybrid
501 - 1,000 Employees
8+ years of experience
AI

Description For Machine Learning Systems Engineer, Encodings and Tokenization

Anthropic is seeking an experienced Machine Learning Systems Engineer to join their Encodings and Tokenization team. This role represents a unique opportunity to work at the intersection of machine learning infrastructure and research, focusing on developing critical systems that power AI model training.

The position involves building and optimizing tokenization systems that are fundamental to Anthropic's AI research progress. As a bridge between Pretraining and Finetuning teams, you'll be responsible for creating infrastructure that enables more efficient and effective training of AI systems while maintaining their reliability and interpretability.

The ideal candidate brings 8+ years of software engineering experience with strong machine learning expertise. You'll work in a collaborative environment that values pair programming and emphasizes the societal impact of AI development. The role requires proficiency in Python, experience with ML infrastructure, and the ability to work effectively in a research-driven environment.

Anthropic offers a competitive compensation package ranging from $300,000 to $405,000 USD annually, along with comprehensive benefits including equity donation matching, generous leave policies, and flexible working arrangements. The position follows a hybrid work model requiring at least 25% in-office presence in either San Francisco or New York City.

The company stands out for its approach to AI research, treating it as an empirical science similar to physics and biology. They focus on large-scale research efforts aimed at developing steerable, trustworthy AI systems. Their work builds on significant research achievements including GPT-3, Circuit-Based Interpretability, and Learning from Human Preferences.

This role offers the opportunity to work on cutting-edge AI technology while contributing to Anthropic's mission of ensuring AI systems are safe and beneficial for society. The position combines technical challenges with meaningful impact, making it ideal for engineers who care about both technical excellence and ethical AI development.

Last updated 2 days ago

Responsibilities For Machine Learning Systems Engineer, Encodings and Tokenization

  • Design, develop, and maintain tokenization systems used across Pretraining and Finetuning workflows
  • Optimize encoding techniques to improve model training efficiency and performance
  • Collaborate closely with research teams to understand their evolving needs around data representation
  • Build infrastructure that enables researchers to experiment with novel tokenization approaches
  • Implement systems for monitoring and debugging tokenization-related issues
  • Create robust testing frameworks to validate tokenization systems
  • Identify and address bottlenecks in data processing pipelines
  • Document systems thoroughly and communicate technical decisions

Requirements For Machine Learning Systems Engineer, Encodings and Tokenization

Python
  • 8+ years of software engineering experience
  • Significant software engineering experience with demonstrated machine learning expertise
  • Comfortable navigating ambiguity in rapidly evolving research environments
  • Strong collaboration skills with cross-functional teams
  • Proficient in Python and familiar with modern ML development practices
  • Strong analytical skills
  • Experience with machine learning systems, data pipelines, or ML infrastructure

Benefits For Machine Learning Systems Engineer, Encodings and Tokenization

Visa Sponsorship
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space for collaboration
  • Visa sponsorship available

Interested in this job?

Jobs Related To Anthropic Machine Learning Systems Engineer, Encodings and Tokenization

Research Engineer, Pre-training

Research Engineer position at Anthropic focusing on pre-training large language models, combining cutting-edge AI research with practical engineering implementation.

Machine Learning Systems Engineer, Model APIs

Machine Learning Systems Engineer role at Anthropic focused on building and maintaining Model Evaluations infrastructure and Research Inference APIs.

Research Engineer, Pre-training

Research Engineer position at Anthropic focusing on pre-training large language models, combining cutting-edge AI research with practical engineering to develop safe and trustworthy AI systems.

Machine Learning Engineer, Generative AI Innovation Center

Senior ML Engineering role at AWS's Generative AI Innovation Center, focusing on developing and deploying advanced ML models and generative AI solutions for enterprise customers.

Machine Learning Engineer, Generative AI Innovation Center

Senior ML Engineering role at AWS's Generative AI Innovation Center, focusing on developing advanced ML models and Gen AI solutions for enterprise customers.