Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The trained model is just a bunch of statistics. To use those statistics to generate text you need to "sample" from the model. If you always sampled by taking the model's #1 token prediction that would be deterministic, but more commonly a random top-K or top-p token selection is made, which is where the randomness comes in.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: