github.com/wbrown/gpt_bpe@v0.0.0-20250709161131-1571a6e8ad2d/resources/data/gpt2-tokenizer/specials.txt (about) 1 <|endoftext|> 2 <|padding|>