Web Magazine for Information Professionals

WebWatch: Surfing Historical UK University Web Sites

Brian Kelly outlines strategies for choosing appropriate standards for building Web sites.

It has been said that those who ignore history, are condemned to repeat it. In the Web world we can be so excited by new developments that we may forget approaches we have taken in the past and fail to learn from our mistakes. This article describes how the WayBack Machine [1] was used to look at the history of UK University Web sites.

The Survey

The survey was carried out by entering the URL of the entry point for UK University Web sites, recording details of the availability of the Web site in the Internet Archive (including earliest and most recent dates and numbers of entries) and providing a link to enable readers of this article to obtain the most recent results.

The full survey findings are given in the Appendix.

Viewing The Past

A "Web Tour" has been developed which provides an automated view of the oldest pages. The display changes every 10 seconds. The Web tour is available at [2].

Discussion

Brief comments are provided on interesting findings in the Appendix. It was noted, for example, that in a number of cases images were not displayed. This will have been due to the site containing a robots.txt file which banned robots from accessing directories containing images for the site, such as the institution's logo, standard navigation images, etc. When the Standard For Robot Exclusion [3] was first released it was applicable for indexing robots such as the robot used by the AltaVista search engine. The consensus view was that it was advisable to ban robots from accessing directories containing images, as these contained no text to be indexed. We can now see that robots provide a wide range of functions which include archiving. There are advantages to be gained in allowing robots to access directories of images, not only so that the Internet Archive's robot can archive a Web site's images, but also to allow indexing robots which can provide a search facility for images (such as the Google image search [4]).

A number of Web sites contained "Best viewed in Netscape 2" style messages. The community probably now accepts that designing best sites for a specific browser is not advisable and it is probably sensible to avoid promoting a specific browser on an institution's entry point.

There appeared to be a number of instances of use of non-native Web technologies such as Java which required browser plugins on institutional home pages.

There were also a number of instances of use of frames or content provided by third parties, (such as usage counters), which were not fully functional in the archived Web site.

Conclusions

The Internet Archive can be a fun resource to access, especially for Web managers who have been developing Web sites for a number of years. However in addition to the embarrassment factor, ("did I really use the <BLINK> tag!"), there are also valuable lessons which can be learnt from our approaches to Web site development in the late 1990s. It can help us to reflect on the processes which lead to the chosen design; the technical decisions which where used and the way in which we sought to support the users of our Web sites.

This will, of course, be a continuous process. The decisions we are making today are the potential embarrassment of tomorrow. Perhaps the most importance message of this survey is the need to preserve access to to our digital past. The Internet Archive may provide one mechanism for preserving the past (provided, of course, that we allow robots to access our resources). Some organisations, however, may not feel comfortable in relying on a third party. In this case there will be a need to develop a digital preservation or records management approach which embraces institutional Web sites.

References

  1. Wayback Machine, Internet Archive
    http://www.archive.org/
  2. Display of UK University Entry Points From Internet Archive, UKOLN
    http://www.ukoln.ac.uk/web-focus/site-rolling-demos/universities-archive/
  3. A Standard for Robot Exclusion, The Web Robot Pages,
    http://www.robotstxt.org/wc/norobots.html
  4. Google Image Search, Google,
    http://images.google.com/


Appendix 1 - Survey

A summary of the findings is given in the following table.

Table 1: Analysis of UK University Web Sites in Internet Archive
 InstitutionEarliest findingCommentsView ArchiveView Earliest Entry
1AberdeenDec 12, 1997 [View][View Earliest Entry]
2Abertay DundeeSep 03, 1999Images not displayed.[View][View Earliest Entry]
3AberystwythDec 10, 1997Images not displayed.[View][View Earliest Entry]
4Anglia Polytechnic UniversityMar 30, 1997Images not displayed.[View][View Earliest Entry]
5AstonJan 23, 1997 [View][View Earliest Entry]
6BangorFeb 20, 1997 [View][View Earliest Entry]
7Bath SpaDec 02, 1998Blank page displayed.[View][View Earliest Entry]
8BathApr 18, 1997 [View][View Earliest Entry]
9Queen's University of BelfastDec 10, 1997 [View][View Earliest Entry]
10Bell CollegeDec 10, 1997 [View][View Earliest Entry]
11Birkbeck CollegeDec 01, 1998Sidebar not available due to robots.txt file.[View][View Earliest Entry]
12BirminghamJan 06, 1997 [View][View Earliest Entry]
13Bishop Grosseteste CollegeNov 11, 1998 [View][View Earliest Entry]
14Bolton InstituteDec 01, 1998 [View][View Earliest Entry]
15Arts Institute at BournemouthOct 09, 1999"Splash screen" causes problems.[View][View Earliest Entry]
16BournemouthOct 20, 1996 [View][View Earliest Entry]
17BradfordApr 30, 1997 [View][View Earliest Entry]
18BrightonJul 15, 1997 [View][View Earliest Entry]
19BristolJun 06, 1997Has links to Extranet and Intranet.[View][View Earliest Entry]
20BrunelDec 10, 1997 [View][View Earliest Entry]
21Buckinghamshire ChilternsNov 25, 1999Creates pop-up window and uses frames and JavaScripted navigation.[View][View Earliest Entry]
22CambridgeFeb 12, 1997 [View][View Earliest Entry]
23Institute of Cancer ResearchFeb 27, 1997 [View][View Earliest Entry]
24Canterbury Christ ChurchFeb 17, 1997 [View][View Earliest Entry]
25CardiffDec 10, 1997 [View][View Earliest Entry]
26University of Wales Institute, CardiffDec 06, 1998Has Java applet which does not work. Images not displayed.[View][View Earliest Entry]
27University of Central EnglandDec 18, 1996 [View][View Earliest Entry]
28University of Central LancashireDec 12, 1997Blank screen.[View][View Earliest Entry]
29Central School of Speech and DramaOct 11, 1999 [View][View Earliest Entry]
30Chester CollegeDec 10, 1997 [View][View Earliest Entry]
31University College ChichesterAug 19, 2000 [View][View Earliest Entry]
32City UniversityFeb 07, 1997 [View][View Earliest Entry]
33Courtauld Institute of ArtDec 06, 1998 [View][View Earliest Entry]
34CoventryDec 11, 1997 [View][View Earliest Entry]
35Cranfield Not archived due to site's robots.txt file.[View][View Earliest Entry]
36Dartington CollegeJul 06, 1997 [View][View Earliest Entry]
37De MontfortJan 05, 1997 [View][View Earliest Entry]
38DerbyJul 03, 1997 [View][View Earliest Entry]
39DundeeJun 03, 1997Large numbers of links.[View][View Earliest Entry]
40DurhamJun 07, 1997Repeated use of logo as watermark.[View][View Earliest Entry]
41East AngliaDec 10, 1997Error in accessing page in Archive.[View][View Earliest Entry]
42University of East LondonJun 26, 1997 [View][View Earliest Entry]
43Edge Hill CollegeJan 25, 1997 [View][View Earliest Entry]
44Edinburgh College of ArtApr 22, 1997 [View][View Earliest Entry]
45EdinburghJan 04, 1997 [View][View Earliest Entry]
46EssexDec 10, 1997 [View][View Earliest Entry]
47ExeterMar 30, 1997 [View][View Earliest Entry]
48Falmouth College-Note archived due to site's robots.txt file.[View][View Earliest Entry]
49GlamorganNov 11, 1998Comment about "Flash 3 plugin".[View][View Earliest Entry]
50Glasgow CaledonianJan 02, 1997Contains link to home page.[View][View Earliest Entry]
51Glasgow School of ArtMay 08, 1997Contains "best viewed with Netscape Navigator 2.0 or higher" message.[View][View Earliest Entry]
52GlasgowFeb 06, 1997 [View][View Earliest Entry]
53GloucestershireJan 22, 1997 [View][View Earliest Entry]
54Goldsmiths CollegeJan 27, 1998 [View][View Earliest Entry]
55GreenwichDec 11, 1997 [View][View Earliest Entry]
56Harper AdamsApr 11, 2000Uses Java applet (not found).[View][View Earliest Entry]
57Heriot-WattJan 14, 1997 [View][View Earliest Entry]
58HertfordshireJul 14, 1997 [View][View Earliest Entry]
59HuddersfieldJun 26, 1997 [View][View Earliest Entry]
60HullFeb 06, 1997 [View][View Earliest Entry]
61Imperial CollegeNov 06, 1996 [View][View Earliest Entry]
62Institute of EducationMay 23, 1997 [View][View Earliest Entry]
63KeeleApr 16, 1997 [View][View Earliest Entry]
64Kent Institute of Art and DesignNov 08, 1996 [View][View Earliest Entry]
65KentOct 14, 1997 [View][View Earliest Entry]
66King Alfred's CollegeDec 02, 1998Framed interface, but pages not in archive.[View][View Earliest Entry]
67King's College LondonJul 07, 1997 [View][View Earliest Entry]
68KingstonDec 10, 1997 [View][View Earliest Entry]
69LampeterFeb 19, 1997 [View][View Earliest Entry]
70LancasterOct 21, 1997 [View][View Earliest Entry]
71Leeds Metropolitan UniversityJul 22, 1997 [View][View Earliest Entry]
72LeedsOct 19, 1996File Retrieve Error.[View][View Earliest Entry]
73LeicesterJun 13, 1997 [View][View Earliest Entry]
74LincolnJan 25, 1998 [View][View Earliest Entry]
75Liverpool HopeApr 14, 1997 [View][View Earliest Entry]
76Liverpool John Moores UniversityApr 29, 1997 [View][View Earliest Entry]
77LiverpoolFeb 17, 1997 [View][View Earliest Entry]
78London Business SchoolJun 30, 1997Contains "You will require the following:
Netscape Navigator 2.1 or higher
Internet Explorer 3.0 or higher" message.
[View][View Earliest Entry]
79London Guildhall UniversityDec 11, 1997Images not displayed.[View][View Earliest Entry]
80London Institute Not available due to robots.txt file.[View][View Earliest Entry]
81University of LondonJun 10, 1998 [View][View Earliest Entry]
82London School of EconomicsDec 31, 1996 [View][View Earliest Entry]
83London School of Hygiene & Tropical MedicineFeb 06, 1997 [View][View Earliest Entry]
84LoughboroughNov 09, 1996 [View][View Earliest Entry]
85LutonDec 11, 1997Has "Best viewed with Netscape" message.
Navigational images not displayed.
[View][View Earliest Entry]
86UMISTApr 05, 1997 [View][View Earliest Entry]
87Manchester Metropolitan UniversityJul 09, 1997 [View][View Earliest Entry]
88ManchesterDec 11, 1997 [View][View Earliest Entry]
89University of Wales College of MedicineApr 13, 1997 [View][View Earliest Entry]
90MiddlesexApr 24, 1997Page not available in index.[View][View Earliest Entry]
91NapierDec 24, 1996 [View][View Earliest Entry]
92NewcastleFeb 15, 1997 [View][View Earliest Entry]
93Newman CollegeApr 10, 1997 [View][View Earliest Entry]
94NewportDec 21, 1996 [View][View Earliest Entry]
95North-East Wales Institute of Higher EducationNov 12, 1996 [View][View Earliest Entry]
96University of North LondonMay 23, 1997 [View][View Earliest Entry]
97University College NorthamptonFeb 27, 1997 [View][View Earliest Entry]
98Northern School of Contemporary DanceMay 07, 1997Framed interface. Usage counter not operational.[View][View Earliest Entry]
99University of NorthumbriaFeb 06, 1998 [View][View Earliest Entry]
100Norwich School of Art and DesignApr 12, 1997 [View][View Earliest Entry]
101Nottingham Trent UniversityJul 24, 1997 [View][View Earliest Entry]
102NottinghamDec 10, 1997 [View][View Earliest Entry]
103Oxford BrookesFeb 12, 1997 [View][View Earliest Entry]
104OxfordDec 11, 1997 [View][View Earliest Entry]
105PaisleyDec 11, 1997 [View][View Earliest Entry]
106PlymouthJan 13, 1998 [View][View Earliest Entry]
107PortsmouthFeb 21, 1997 [View][View Earliest Entry]
108Queen Margaret University College,Dec 24, 1996 [View][View Earliest Entry]
109Queen Mary and Westfield CollegeApr 23, 2001 [View][View Earliest Entry]
110Ravensbourne CollegeDec 31, 1996 [View][View Earliest Entry]
111ReadingJan 03, 1997 [View][View Earliest Entry]
112University of Wales, RegistryDec 05, 1998 [View][View Earliest Entry]
113Robert Gordon UniversityJul 02, 1997 [View][View Earliest Entry]
114University of Surrey, RoehamptonJun 03, 1997 [View][View Earliest Entry]
115Rose Bruford CollegeDec 05, 1998 [View][View Earliest Entry]
116Royal Academy of MusicDec 12, 1998Grey box on blue background.[View][View Earliest Entry]
117Royal Agricultural CollegeApr 05, 1997 [View][View Earliest Entry]
118Royal College of ArtJan 20, 1997Java applets not found.[View][View Earliest Entry]
119Royal College of MusicDec 19, 1996 [View][View Earliest Entry]
120Royal HollowayAug 17, 2000Displays simple text file, with message about page for modern browsers which support JavaScript.[View][View Earliest Entry]
121Royal Northern College of MusicJan 25, 1998 [View][View Earliest Entry]
122Royal Scottish Academy of Music and DramaOct 12, 1997 [View][View Earliest Entry]
123Royal Veterinary CollegeJun 23, 1998 [View][View Earliest Entry]
124St AndrewsJan 07, 1997 [View][View Earliest Entry]
125St George's Hospital Medical SchoolOct 22, 1997 [View][View Earliest Entry]
126College of St Mark and St JohnDec 24, 1997 [View][View Earliest Entry]
127St Martin's CollegeDec 28, 1996 [View][View Earliest Entry]
128St Mary's CollegeJan 17, 1999 [View][View Earliest Entry]
129SalfordJan 29, 1997 [View][View Earliest Entry]
130School of Oriental and African StudiesJan 21, 1997 [View][View Earliest Entry]
131School of PharmacyDec 06, 1998 [View][View Earliest Entry]
132Scottish Agricultural CollegeOct 07, 1997Contains large grey image, provided by Internet Archive.[View][View Earliest Entry]
133Sheffield HallamOct 11, 1997Page timed out - uses JavaScript to detect Netscape version 2.[View][View Earliest Entry]
134SheffieldNov 11, 1998Path Index Error.[View][View Earliest Entry]
135South Bank UniversityJul 19, 1997 [View][View Earliest Entry]
136Southampton InstituteDec 21, 1997Has "recommended that these pages be viewed using Netscape 2.0 or Internet Explorer 3.0" message.[View][View Earliest Entry]
137SouthamptonDec 29, 1996 [View][View Earliest Entry]
138StaffordshireFeb 21, 1997 [View][View Earliest Entry]
139StirlingFeb 28, 1997 [View][View Earliest Entry]
140StrathclydeJun 05, 1997 [View][View Earliest Entry]
141SunderlandDec 11, 1997 [View][View Earliest Entry]
142Surrey Institute of Art and DesignDec 11, 1997 [View][View Earliest Entry]
143SurreyMay 03, 1997 [View][View Earliest Entry]
144SussexDec 11, 1997 [View][View Earliest Entry]
145Swansea InstituteJan 10, 1997 [View][View Earliest Entry]
146University of Wales, SwanseaDec 23, 1996 [View][View Earliest Entry]
147TeessideJul 09, 1997 [View][View Earliest Entry]
148Thames Valley UniversityDec 22, 1996 [View][View Earliest Entry]
149Open UniversityFeb 01, 1997 [View][View Earliest Entry]
150Trinity College of Music-No matches found.[View][View Earliest Entry]
151Trinity College, CarmarthenJan 22, 1998No text displayed.[View][View Earliest Entry]
152Trinity and All Saints CollegeNov 29, 1996 [View][View Earliest Entry]
153UlsterFeb 12, 1997Path Index Error.[View][View Earliest Entry]
154University College LondonDec 10, 1997Path Index Error.[View][View Earliest Entry]
155WarwickDec 10, 1997Images not displayed.[View][View Earliest Entry]
156Royal Welsh College of Music and Drama-No matches found.[View][View Earliest Entry]
157University of the West of EnglandJan 03, 1997 [View][View Earliest Entry]
158WestminsterDec 11, 1997 [View][View Earliest Entry]
159WolverhamptonFeb 04, 1997Path Index Error.[View][View Earliest Entry]
160University College WorcesterMay 26, 1997 [View][View Earliest Entry]
161Writtle CollegeMar 29, 1997 [View][View Earliest Entry]
162York St John CollegeAug 03, 2001Path Index Error.[View][View Earliest Entry]
163YorkJan 08, 1997 [View][View Earliest Entry]

The information in the table was initially collected between 20-23 December 2002

Author Details

Picture of Brian Kelly Brian Kelly
UK Web Focus
UKOLN
University of Bath
Bath
BA2 7AY

Email: b.kelly@ukoln.ac.uk

Brian Kelly is UK Web Focus. He works for UKOLN, which is based at the University of Bath