Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> we are inspired by the recent advancements in reinforcement learning (e.g., o1)

It is interesting to see what the future will bring when models incorporate chain of thought approaches and whether o1 will get outperformed by open source models.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: