> As a result, the model isn’t trained on understanding the useRalativeImagePath...

londons_explore · on Jan 22, 2024

I suspect more likely this token is simply blacklisted after the r/counting incident - ie. any response containing it will now return an error.

wycy · on Jan 22, 2024

What was the r/counting incident?

sgerenser · on Jan 22, 2024

Presumably this: https://www.vice.com/en/article/epzyva/ai-chatgpt-tokens-wor...

shawntan · on Jan 22, 2024

Exactly this. The tokens generated should always be valid, unless some post-processing layer between the model's output and the user interface detects for some keywords which it would prefer to filter out. In which case I suppose there is another commonly seen error message that appears?

SirSegWit · on Jan 22, 2024

Not really, right? There are a ton of special tokens, like start of sequence etc., so what happens if there are two start of sequences predicted? It's a valid token but cannot really be turned into something sensible, so it throws an error when converting tokens to plain text?

minimaxir · on Jan 22, 2024

Special tokens are handled by the application, not the model. They are still output before then.

npsomaratna · on Jan 22, 2024

Correct me if I'm wrong—but we don't know if GPT-4 uses the same tokenizer as GPT-3.5, right?

minimaxir · on Jan 22, 2024

OpenAI's web tokenizer demo confirms it: https://platform.openai.com/tokenizer