Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Kimi is noticeably better at tool calling than gpt-oss-120b.

I made a fun toy agent where the two models are shoulder surfing each other and swap the turns (either voluntarily, during a summarization phase), or forcefully if a tool calling mistake is made, and Kimi ends up running the show much much more often than gpt-oss.

And yes - it is very much fun to build those!



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: