On my 24GB RAM M4 Pro MBP some models run very quickly through LM Studio to Zed,...

jbellis · 2026-03-04T03:42:48 1772595768

Your limitation after prefill is memory bandwidth. A maxed out Studio has less than a single 3090 (really).

sroussey · 2026-03-04T07:07:03 1772608023

Yeah, the 3090 has faster memory, but not by a lot.

The 5090 is at 1,792GB/sec and potential M5 Ultra would be 1,230GB/sec and 512GB RAM. Maybe 1TB. Not 32.

thejazzman · 2026-03-05T13:04:05 1772715845

You’re suggesting that a difference of the entirety of the M5 Max’s bandwidth is an insignificant gap!

veidr · 2026-03-05T14:55:32 1772722532

No, that difference is the 5090, not the 3090.

efxhoy · 2026-03-03T20:06:47 1772568407

How is the output quality of the smaller models?

elsombrero · 2026-03-04T00:00:23 1772582423

not good enough for coding anything more than simple scripts.

generally, the less parameters, the less knowledge they have.

kraig911 · 2026-03-04T02:09:25 1772590165

what model were you using?

giancarlostoro · 2026-03-04T21:14:33 1772658873

Wrote about it here: