Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Squarex
23 days ago
|
parent
|
context
|
favorite
| on:
Claude Opus 4.6
it's not a great benchmark anymore... starting with it being python / django primarily... the industry should move to something more representative
usaar333
23 days ago
[–]
Openai has; they don't even mention score on gpt-5.3-codex.
On the other hand, it is their own verified benchmark, which is telling.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: