Skip to main content

Keep the news in the Wayback Machine. Sign Fight for the Future's letter.

View Post [edit]

Poster: Archive Lover1 Date: Oct 23, 2015 8:44am
Forum: web Subject: Wayback machine rebuild suggestions

Hey,
I just read on the verge you guys had secured funding for a complete wayback machine rebuild - congratulations!

I have a couple of suggestions:

Allow the ability to use the legacy wayback machine during introduction, such as you did with the site upgrade.

The robots.txt implementation, although understandable, I think could be improved. Domains can be hijacked and robots.txt changed - making the entire stored wayback sites inaccessible. Perhaps allow access to all historical captures, and only stop crawling when robots.txt is changed.

The search function sounds scary. Be careful with this!
Searching for people's names will bring up a lot of privacy issues, and more importantly would show their cringey sites from 2006 :)

Good luck!

Reply [edit]

Poster: h891322 Date: Dec 12, 2015 5:55am
Forum: web Subject: Re: Wayback machine rebuild suggestions

I absolutely agree about robots.txt, its current state should not affect any previously archived pages.
I wish the Wayback Machine could completely ignore robots.txt rules when saving pages manually using http://archive.org/web/.

Also, it seems that any feedback regarding Wayback Machine issues sent to info@archive.org (I don't know any other contact means) is not being read, I never got a reply.