Machine Learning Systems Engineer, Encodings and Tokenization

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development for users and society.

San Francisco, CA, USA • New York, NY, USA

$300,000 - $405,000

Machine Learning

Staff Software Engineer

Hybrid

501 - 1,000 Employees

8+ years of experience

Description For Machine Learning Systems Engineer, Encodings and Tokenization

Anthropic is seeking an experienced Machine Learning Systems Engineer to join their Encodings and Tokenization team. This role represents a unique opportunity to work at the intersection of machine learning infrastructure and research, focusing on developing critical systems that power AI model training.

The position involves building and optimizing tokenization systems that are fundamental to Anthropic's AI research progress. As a bridge between Pretraining and Finetuning teams, you'll be responsible for creating infrastructure that enables more efficient and effective training of AI systems while maintaining their reliability and interpretability.

The ideal candidate brings 8+ years of software engineering experience with strong machine learning expertise. You'll work in a collaborative environment that values pair programming and emphasizes the societal impact of AI development. The role requires proficiency in Python, experience with ML infrastructure, and the ability to work effectively in a research-driven environment.

Anthropic offers a competitive compensation package ranging from $300,000 to $405,000 USD annually, along with comprehensive benefits including equity donation matching, generous leave policies, and flexible working arrangements. The position follows a hybrid work model requiring at least 25% in-office presence in either San Francisco or New York City.

The company stands out for its approach to AI research, treating it as an empirical science similar to physics and biology. They focus on large-scale research efforts aimed at developing steerable, trustworthy AI systems. Their work builds on significant research achievements including GPT-3, Circuit-Based Interpretability, and Learning from Human Preferences.

This role offers the opportunity to work on cutting-edge AI technology while contributing to Anthropic's mission of ensuring AI systems are safe and beneficial for society. The position combines technical challenges with meaningful impact, making it ideal for engineers who care about both technical excellence and ethical AI development.

Last updated 2 days ago

Responsibilities For Machine Learning Systems Engineer, Encodings and Tokenization

Design, develop, and maintain tokenization systems used across Pretraining and Finetuning workflows
Optimize encoding techniques to improve model training efficiency and performance
Collaborate closely with research teams to understand their evolving needs around data representation
Build infrastructure that enables researchers to experiment with novel tokenization approaches
Implement systems for monitoring and debugging tokenization-related issues
Create robust testing frameworks to validate tokenization systems
Identify and address bottlenecks in data processing pipelines
Document systems thoroughly and communicate technical decisions

Requirements For Machine Learning Systems Engineer, Encodings and Tokenization

Python

8+ years of software engineering experience
Significant software engineering experience with demonstrated machine learning expertise
Comfortable navigating ambiguity in rapidly evolving research environments
Strong collaboration skills with cross-functional teams
Proficient in Python and familiar with modern ML development practices
Strong analytical skills
Experience with machine learning systems, data pipelines, or ML infrastructure

Benefits For Machine Learning Systems Engineer, Encodings and Tokenization

Visa Sponsorship

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Office space for collaboration
Visa sponsorship available

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development for users and society.

San Francisco, CA, USA • New York, NY, USA

$300,000 - $405,000

Machine Learning

Staff Software Engineer

Hybrid

501 - 1,000 Employees

8+ years of experience

Interested in this job?

Jobs Related To Anthropic Machine Learning Systems Engineer, Encodings and Tokenization

Research Engineer, Pre-training

Anthropic

Research Engineer position at Anthropic focusing on pre-training large language models, combining cutting-edge AI research with practical engineering implementation.

Machine Learning Systems Engineer, Model APIs

Anthropic

Machine Learning Systems Engineer role at Anthropic focused on building and maintaining Model Evaluations infrastructure and Research Inference APIs.

Research Engineer, Pre-training

Anthropic

Research Engineer position at Anthropic focusing on pre-training large language models, combining cutting-edge AI research with practical engineering to develop safe and trustworthy AI systems.

Machine Learning Engineer, Generative AI Innovation Center

Amazon

Senior ML Engineering role at AWS's Generative AI Innovation Center, focusing on developing and deploying advanced ML models and generative AI solutions for enterprise customers.

Machine Learning Engineer, Generative AI Innovation Center

Amazon

Senior ML Engineering role at AWS's Generative AI Innovation Center, focusing on developing advanced ML models and Gen AI solutions for enterprise customers.