Would love to find out they're overfitting for pelican drawings.

fdeage · 2026-02-05T23:28:49 1770334129

OpenAI claims not to: https://x.com/aidan_mclau/status/1986255202132042164

mattacular · 2026-02-06T13:01:28 1770382888

That settles it

fragmede · 2026-02-05T19:54:41 1770321281

The estimation I did 4 months ago:

> there are approximately 200k common nouns in English, and then we square that, we get 40 billion combinations. At one second per, that's ~1200 years, but then if we parallelize it on a supercomputer that can do 100,000 per second that would only take 3 days. Given that ChatGPT was trained on all of the Internet and every book written, I'm not sure that still seems infeasible.

https://news.ycombinator.com/item?id=45455786

eli · 2026-02-05T20:07:21 1770322041

How would you generate a picture of Noun + Noun in the first place in order to train the LLM with what it would look like? What's happening during that 1 estimated second?

fragmede · 2026-02-06T12:59:02 1770382742

Use any of the image generation models (eg Nanobanana, Midjourney, or ChatGPT) to generate a picture of a noun on a noun. Simonw's test is to have a Language (text) model generate a Scalar Vector Graphic, which the language model has to do by writing curves and colors, like draw a spline from point 150,100 to 200,300 of type cubic, using width 20, color orange.

In that hypothetical second is freaking fascinating. It's a denoising algorithm, and then a bunch of linear algebra, and out pops a picture of a pelican on a bicycle. Stable diffusion does this quite handily. https://stablediffusionweb.com/image/6520628-pelican-bicycle...

metalliqaz · 2026-02-05T21:51:52 1770328312

its pelicans all the way down

Terretta · 2026-02-05T20:39:20 1770323960

This is why everyone trains their LLM on another LLM. It's all about the pelicans.

AnimalMuppet · 2026-02-05T21:28:47 1770326927

But you need to also include the number of prepositions. "A pelican on a bicycle" is not at all the same as "a pelican inside a bicycle".

There are estimated to be 100 or so prepositions in English. That gets you to 4 trillion combinations.

jodrellblank · 2026-02-06T15:41:15 1770392475

The prompt was "a pelican riding a bicycle"; not prepositions but every verb. Potentially every adverb+verb combination - "a pelican clumsily pushing a bicycle"

theanonymousone · 2026-02-05T21:44:45 1770327885

Even if not intentionally, it is probably leaking into training sets.

andy_ppp · 2026-02-05T19:16:19 1770318979

Yes, Racoon on a unicycle? Magpie on a pedalo?

throw310822 · 2026-02-05T21:27:51 1770326871

Correct horse battery staple:

https://claude.ai/public/artifacts/14a23d7f-8a10-4cde-89fe-0...

Schlagbohrer · 2026-02-06T15:58:24 1770393504

That is the nastiest, ugliest horse ever

HappMacDonald · 2026-02-07T23:46:29 1770507989

wait, how do you know my pw?

ta988 · 2026-02-05T21:49:58 1770328198

no staple?

iwontberude · 2026-02-05T22:24:54 1770330294

it looks like a bodge wire

_kb · 2026-02-05T22:38:37 1770331117

Platypus on a penny farthing.