Hacker News new | past | comments | ask | show | jobs | submit login

IIRC archive.org pay attention to the robots.txt now rather than in 2024.



They do not. They have in the past removed public access to pages due to robots.txt changes.


Hmm, this old blog post sounds like they were planning on ending that practice:

https://blog.archive.org/2017/04/17/robots-txt-meant-for-sea...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: