Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Anna's archive has some 300M pdfs.


We're talking about the open web here. But yeah that's the point, the dataset is unreasonably small.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: