This is a simple web client that exposes web archive holdings via the Memento protocol.
Given a URL, the client uses one or more known TimeGates to look up all known archival holdings, them summarizes them and provides links.
To run it, use
play run
or, if you are behind a proxy, specify the proxy server as follows:
play run -Dhttp.proxyHost=my.proxy.host -Dhttp.proxyPort=3127
-
Make the urlender script more robust so it times out itself?
-
Add busy logos/warning while screenshots are loaded?
-
Add text that displays if there is no JavaScript?
-
No LIVE?
-
Fix caching so that pre-filtered results are cached instead?
-
Use Future pattern for long calls to avoid AskTimeoutException (see on graphs failing to load. JS?).
-
Allow years to be selected, in turn filtering the list of Mementos to show a year-long graph etc.
-
Add a lots-of-screenshots timeline.
-
Use timeline screenshots to estimate degree of change over time (e.g. even just colour changes).
-
Make screenshots cope better with none-HTML content?
One minor issue appears to be that the lanl.gov aggregate TimeGate does not appear to aggregate everything. Things like individual Wikipedia pages are (e.g. ) not coming up, despite the fact they are in the UK Web Archive and indeed reported by our TimeGate. They are also clearly stored elsewhere, e.g. http://wayback.archive.org/web/*/http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_is_an_encyclopedia
Leads via a redirect to the timebundle
Which is the RDF form, so need an RDF parser to extract the instances.
It appears the Aggregator is still using a proxy, so we should move Memento to production.
http://mementoweb.org/depot/proxy/BL/
Error: 404Unknown time map serialization |
Note that
Works fine, so I wonder if the error on the BL proxy is throwing the whole aggregation off?
http://mementoproxy.lanl.gov/aggr/timegate/ http://mementoproxy.lanl.gov/google/timegate/ http://www.webarchive.org.uk/wayback/memento/timegate/
See also http://mementoweb.org/depot/
http://www.slideshare.net/hvdsomp/memento-updated-technical-details-february-2010 See also the JISC Repositories mailing list discussion.https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind1002&L=JISC-REPOSITORIES&T=0&F=&S=&P=10604 http://arxiv.org/abs/1003.3661
http://lists.w3.org/Archives/Public/public-lod/2010Mar/0169.html