Funny how we now see AI go through developmental phases similar to what we see i...

dspillett · 2026-02-24T10:54:14 1771930454

Also strawberry spelling isn't any real test for current LLMs as they have no concept of letters, they work on tokens which may be several characters including punctuation and numerals. To have any hope of getting that question right tokens would have to have the granularity of individual letters, massively ballooning model size and training time, or the LLM needs to be able to call out to an external tool that will return the result (and needs sufficient examples in the training data to prime that trigger to fire).

thatjoeoverthr · 2026-02-24T11:32:14 1771932734

While that's true, the tokenizer is half the problem. The important fault demonstrated is it doesn't _know_ it can't see the letters, and won't express this unless it has been trained or instructed to. "I can't see letters through the tokenizer" never appears in a corpus of human writing.

randomtoast · 2026-02-24T10:03:47 1771927427

Enable reasoning effort and the results are completely different.