I would be great if common crawl (or anyone else) would also release a document-...

		ma2rten on Nov 28, 2013 \| parent \| context \| favorite \| on: 102TB of New Crawl Data Available I would be great if common crawl (or anyone else) would also release a document-term index for it's data. If you had an index, you could do a lot more things with this data.