github.com/wbrown/gpt_bpe@v0.0.0-20250709161131-1571a6e8ad2d/resources/data/llama-tokenizer (about)

duplicates.json
encoder.json
merges.json
special_tokens_map.json
specials.txt
tokenizer.model
tokenizer_config.json