Changes between Initial Version and Version 1 of ArchivingSites


Ignore:
Timestamp:
03/18/14 10:57:39 (3 years ago)
Author:
chris
Comment:

Page created trying to draw some lessons from the problems archiving the in transition movie site

Legend:

Unmodified
Added
Removed
Modified
  • ArchivingSites

    v1 v1  
     1Some things to think about when archiving an old site: 
     2 
     3Lots of time and energy goes into creating content, please think carefully before making decisions to throw content away. What may appear to have no value to you might well have value to someone else -- archives do have value. This is not a new idea, please read [http://www.nngroup.com/articles/web-pages-must-live-forever/ Web Pages Must Live Forever] and [http://www.w3.org/Provider/Style/URI Cool URIs don't change]. 
     4 
     5 
     6Archiving sites well takes some time, if it is rushed then some things that should be considered might be missed. 
     7 
     8 
     9Static HTML is the best form for archives as it doesn't require maintenance, however if mass edits need to be made then they need to be done to the dynamic site before it it archived. 
     10 
     11[http://httrack.com/ HTTrack] is a great tool for archiving sites, it is in Debian and can be run on the command line, in screen, on the server where the archive is to live. 
     12 
     13 
     14Forms, eg contact forms and search forms won't work on the static archive, these forms are best removed before the archiving is done. 
     15 
     16 
     17Error pages, some URLs will change when the site is archived, best set up custom error pages to catch these. 
     18 
     19 
     20Webbugs, best remove any GA or other such bugs before archiving, in addition if the archive is to have Piwik stats then best add the Piwik code before creating the archive.