Nostalgebraist talked about the extreme weirdness of BPEs and how they adjust chaotically centered on whitespace, capitalization, and context for GPT-2, with a followup publish for GPT-3 on the even weirder encoding of quantities sans commas.
Feel free to visit my blog post ::
Going Listed here