Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For longer documents it uses vector embeddings


How's that different from pasting the text in the first chat and running the vector embedding step on the text on the server (maybe at least bypassing the chat text limit)? Does this fix the amnesia issue where the info from chats longer than the context length is forgotton because the document isn't baked directly into the weights like fine tuning?




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: