github.com/wbrown/gpt_bpe@v0.0.0-20250709161131-1571a6e8ad2d/resources/data/clip-tokenizer/specials.txt (about) 1 <|startoftext|> 2 <|endoftext|>