Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It would be great to see more focus on Chinchilla's result that most large models were quite undertrained with respect to optimal reduction in test loss.


agreed, we did not discuss that sufficiently




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: