Hacker News new | past | comments | ask | show | jobs | submit login

What do you do about things you "don't" want backed up? Say old portfolio or social media sites tied to your name? If you can wayback any site doesn't this present some issues to sanitizing your online footprint?



If you control the site, you can use robots.txt if you really want to. Though I'd think carefully about if you really want to do this.

If someone else owns the site or if it's a social media site, you'd have to see what the site owner will do. There's probably not much you can do on your own to prevent the site from being archived,


archive.org respects robots.txt so you can exclude what you want I guess, I'm not sure whether or not that's a great thing though.

Preferrably people would think more carefully before disallowing / I guess. I have many times been disappointed that info is entirely gone because archive.org respects robots.txt and the site is now offline forever.


I suppose it depends on whether you place greater value on historical accuracy or personal image. No doubt lots of people have published silly or embarrassing things years ago, but those things are still real.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: