Apparently the score would be a little higher if it weren't for the fact that sc...

modeless · 2026-03-27T20:57:56 1774645076

Once you have matched humans on a problem then further progress on that problem is not necessarily meaningful anymore, in terms of quantitative measurement of intelligence. ARC-AGI-3 is designed to compare AIs to humans, not to measure arbitrarily high levels of superhuman intelligence. For that you would want a different benchmark.