More

BrokenCogs · 2026-04-06T13:13:29 1775481209

Who cares how it compares, it's not a product it's a cool project

tantalor · 2026-04-06T13:28:18 1775482098

Even cool projects can learn from others. Maybe they missed something that could benefit the project, or made some interesting technical choice that gives a different result.

For the readers/learners, it's useful to understand the differences so we know what details matter, and which are just stylistic choices.

This isn't art; it's science & engineering.

BrokenCogs · 2026-04-06T13:52:37 1775483557

But it isn't the OP's responsibility to compare their project to all other projects. The GP could themselves perform the comparison and post their thoughts instead of asking an open ended question.

philipallstar · 2026-04-06T14:23:51 1775485431

> it isn't the OP's responsibility to compare their project to all other projects

No one, including the GP, said it was.

fg137 · 2026-04-06T14:54:33 1775487273

It isn't, but such information will be immensely helpful to anyone who wants to learn from such projects. Some tutorials are objectively better than others, and learners can benefit from such information.

tantalor · 2026-04-06T14:02:27 1775484147

100% agree, I didn't mean to imply that OP is responsible for that, or that the (lack of) comparison detracts in any way from the work.

stronglikedan · 2026-04-06T17:39:21 1775497161

> Who cares how it compares

Well, the person who asked the question, for one. I'm sure they're not the only one. Best not to assume why people are asking though, so you can save time by not writing irrelevant comments.

layer8 · 2026-04-06T16:07:28 1775491648

Microgpt isn’t a product either. Are you saying that differences between cool projects aren’t worth thinking and conversing about?

BrokenCogs · 2026-04-03T14:33:33 1775226813

Now please build a frontpage for all the frontpages on blogs

BrokenCogs · 2026-04-02T16:35:53 1775147753

You're absolutely right!

BrokenCogs · 2026-03-24T17:13:34 1774372414

The Matrix style human pods: we live in blissful ignorance in the Matrix, while the LLMs extract more and more compute power from us so some CEO somewhere can claim they have now replaced all humans with machines in their business.

throwup238 · 2026-03-24T17:26:48 1774373208

I was thinking more of the season 3 episode of Doctor Who titled Gridlock where everyone lives in flying cars circling a giant expressway underground, while all the upper class people on the surface died years ago from a pandemic.

ting0 · 2026-03-24T17:37:57 1774373877

Ever get the feeling that the universe is reading your mind? Maybe there's some truth to that after all.

BrokenCogs · 2026-03-23T19:19:57 1774293597

Does autoresearch work for projects that are not llm based? Eg in karpathy's example he is optimizing the nanogpt. What if I wanted to improve a Unet for image segmentation?

simonw · 2026-03-23T19:23:23 1774293803

Tobi from Shopify used a variant of autoresearch to optimize the Liquid template engine, and found a 53% speedup after ~120 experiments: https://github.com/Shopify/liquid/pull/2056

I wrote up some more notes on that here: https://simonwillison.net/2026/Mar/13/liquid/

Denzel · 2026-03-23T19:50:56 1774295456

How much did this cost? Has there ever been an engineering focus on performance for liquid?

It’s certainly cool, but the optimizations are so basic that I’d expect a performance engineer to find these within a day or two with some flame graphs and profiling.

simonw · 2026-03-23T19:59:24 1774295964

He used Pi as the harness but didn't say which underlying model. My stab-in-the-air guess would be no more than a few hundred dollars in token spend (for 120 experiments run over a few days assuming Claude Opus 4.6 used without the benefits of the Claude Max plan.)

So cheaper than a performance engineer for a day or two... but the Shopify CEO's own time is likely a whole lot more expensive than a regular engineer!

sdenton4 · 2026-03-23T19:23:01 1774293781

The gist of these things is you point them at an eval metric and say 'make it go better.' so, you can point it at anything you can measure. The example in the blog post here is bonding boxes on wood cut images.

Adrig · 2026-03-24T09:55:15 1774346115

Yes, that's the real strenght of it. The structure is dead simple so you just have to switch the goal metric.

I used it on a data science project to find the best rules for achieving a defined outcome. At first, for fun, then I actually used some of its insights (and it caught a sampling issue I overlooked, oops)

bethekind · 2026-03-23T19:41:16 1774294876

I used it to speed up an codecompass-like repo from 86 files per second to 2000. Still haven't used the repo in production, so maybe it secretly broke things, but the ability to say: "optimize this benchmark and commit only if you pass these tests" is nice

ks2048 · 2026-03-23T20:33:24 1774298004

I think image segmentation is in the same class as LLMs - ML experiments.

What about more distant software projects? Give it the CPython source code and say you want it to be faster.

BrokenCogs · 2026-03-22T22:48:35 1774219715

What are the pros of using openclaw?

Using telegram? Being able to automatically create calendar events based on emails?

BrokenCogs · 2026-03-19T16:15:17 1773936917

The CBC is reporting the analysis of The World Happiness Report - it's not coming to its own conclusions. Maybe you should read the article and original source yourself before making hasty comments.

justinhj · 2026-03-19T18:02:39 1773943359

I don't click that garbage I've read enough of their work to know. They are the enemy of Canadians.

BrokenCogs · 2026-03-17T21:48:18 1773784098

This needs to be higher

drcongo · 2026-03-18T12:45:23 1773837923

Garry clearly is.

BrokenCogs · 2026-03-12T19:16:44 1773343004

This is not true. The hack did not affect Stryker products sold to hospitals and clinics, it only impacted Stryker employees work and personal devices. Yes 50tb of data was exfiltrated and it remains to be seen what that data is and how it might impact products down the line.

overtone1000 · 2026-03-12T20:54:08 1773348848

Medical equipment reps often play a pretty active role in patient care. Can't get in touch with a rep to put a device into its MRI safe mode? No MRI for you. Can't get a rep in to help the surgeon with the type in hardware they were going to install? No surgery for you.

People's AICDs aren't going to start exploding, but I'm pretty confident this will hamper care for many patients.

BrokenCogs · 2026-03-12T13:33:10 1773322390

Is this like Bob Fossil's Zooniverse?