Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

About 12 TB uncompressed json until the middle of 2022, with a dataset that grows 250GB+ per month. If you throw away all metadata you are left with between half and a quarter of that in high quality text.


> high quality

That's a hot take




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: