Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's why I always record the number of rolls it takes to get to an acceptable result on my GenAI Comparison site for each model - it's a broad metric indicating how much you have to fight to steer the model in the right direction.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: