I wish web.archive.org had an index by someone like common crawl. There is lots ...

wumpus · on Jan 12, 2022

web.archive.org has a CDX index, similar to Common Crawl.

Since I use both of these archives together, I wrote this code to iron out the differences between them:

kevinsundar · on Jan 12, 2022

Hey! I was using your tool a couple months ago. It was super helpful for my project.

wumpus · on Jan 13, 2022

Thanks! I rarely hear from users, great to hear from you!

kevinsundar · on Jan 12, 2022

They do and its better than common crawl's by my testing.