Hacker Newsnew | past | comments | ask | show | jobs | submit | scotty79's commentslogin

Previous iterations of ARC-AGI were reminiscent of IQ tests. This one is just too easy and the fact that models do terribly bad on it probably means that there is input mode mismatch or operation mode mismatch.

If model creators are willing to teach their llms to play computer games through text it's gonna be solved in one minor bump of the model version. But honestly, I don't think they are gonna bother because it's just too stilly and they won't expect their models are going to learn anything useful from that.

Especially since there are already models that can learn how to play 8-bit games.

It feels like ARC-AGI jumped the shark. But who knows, maybe people who train models for robots are going to take it in stride.


General intelligence not owning retinas.

Denying proper eyesight harness is like trying to construct speech-to-text model that makes transcripts from air pressure values measured 16k times per second, while human ear does frequency-power measurement and frequency binning due to it's physical construction.


> Netanyahu's follow-up coffee shop video is real too

Really? The coffee in his cup, filled to the brim, did the most bizarre dance possible. And he handled the cup as if was empty, without any care.


Today it's a joke, but in a year or two it's gonna be genuine strategy to avoid paying yourself for all the inference your open source project needs. Tokens are gonna be worth a lot. Event today there are already programmers who are burning more money for tokens than their salary is and it's still worth for their employers. Open source projects with shoestring budgets won't be able to afford that.

"There's infinite amount of money in the federal reserve."

I think models from one year ago with proper harness should be easily beating humans at this task on average. Human CEOs decisions are worse than random chance.

I feel like WinForms was peak (because it was close to Delphi) and it went downhill from there.

IMO WinForms didn't manage to reach even Delphi 2's state, so i'd say it is far from peak considering the improvements later Delphis (and now Lazarus) added. It was abandoned too soon.

I meant peak for MS.

> It was abandoned too soon.

This is probably a recurring theme for MS UI frameworks.


Does the thread of you your consciousness end when you go to sleep?

Does the thread of someone elses consciousness ends when they experience grand mal seizure and thrir electrical brain activity goes wrong all at once and then resets?

How's "waking up" in the virtual different from waking up from grand mal seizure? (assuming that all relevant biochemical data of neurons was read correcly and their behavior is simulated correctly)


No because you know it was you who was in deep sleep And just woke up

What does "you" even mean?

> Online anonymity has significant, real-world drawbacks.

Online anonymity has significant, real-world benefits which every doxxed person ever will list for you.


And drawbacks, too. Imagine if you could only dox someone else by doxxing yourself at the same time.

I don’t think that is really a sufficient defense? The amount of focus pointed at the person matters for this.

I wouldn't even call them two groups. It's just one group ostensibly and publicly split in half, but it's still one group that intermingles behind the courtains.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: