See Byte Pair Encoding happen step by step. Type any text and watch character pairs merge, building the vocabulary in real time.
Pairs are counted and merged in order of frequency. Each merge adds a new token to the vocabulary.
| # | Pair | New Token ID | Count |
|---|