A middle way could be to only observe robots.txt for crawling, and not for displ...

		C4K3 on April 25, 2018 \| parent \| context \| favorite \| on: Addressing Recent Claims of “Manipulated” Blog Pos... A middle way could be to only observe robots.txt for crawling, and not for displaying pages. So once a page is grabbed, it's available forever. But if a page is covered by a robots.txt exclusion, it won't be crawled.