Department of Software Technology
Vienna University of Technology
The AOLA Austria On-Line Archive Project - Related Projects
Numerous other projects address issues related to building web archives. This page provides an overview of the most important initiatives related to the AOLA Austria On-Line Archive.
- KulturarW3 - http://kulturarw3.kb.se/html/kulturarw3.eng.html
The Royal Library (Kungl. biblioteket, abbreviated KB), National Library of Sweden, tasked since 1661 with collecting all Swedish printed publications, has
inaugurated a project, entitled Kulturarw³ (The Swedish Archiw³e), with a view to the long-term preservation of electronic documents.The aim of this project is to'test methods of collecting, preserving and providing access to Swedish electronic documents which are accessible on line in such a way that they can be regarded as published. Through this project KB is also laying the foundations of a collection of Swedish electronic publishing for our time and for coming generations.
Kulturaw³ uses NWA-Combine, a modified version of the Combine harvester, optimized for archiving purposes.
- Combine - http://www.lub.lu.se/combine/
Combine is an open system for harvesting and threshing (indexing) Internet resources. The name is derived from the combine-harvester since the two perform their jobs in a similar way. The Combine was initially developed as a part of the Development of a European Service for Information on Research and Education (DESIRE) project, which was funded by the European Commission within Telematics for Science Program.
The Combine Robot was released in spring '98, and is currently used whithin the nordic NWI services in Sweden and Denmark. It is also used in other contexts like the research information system Safari.
- NEDLIB - http://www.kb.nl/coop/nedlib/
NEDLIB is a collaborative project of European national libraries. It aims to construct the basic infrastructure upon which a networked European deposit library can be built. The objectives of Nedlib concur with the mission of national deposit libraries to ensure that electronic publications of the present can be used now and in the future. Within the project, a special NEDLIB Harvester was developed, optimized for archiving, rather than indexing tasks.
- Nordic Web Archive - http://nwa.nb.no
The Nordic Web Archive (NWA) is a cooperation between the Nordic National Libraries.
NWA started in 1997 as a forum for co-ordination and exchange of experience in the fields of harvesting and archiving web documents. The cooperation culminated in an application to Nordunet2 of funds to develop the NWA Access Module, a common tool for accessing the archived web documents.
The development of the NWA Access Module is organised as a project with a steering group at the top populated by the National Librarians. Beneath this steering group is yet another steering group populated by technical specialists from each of the Nordic Libraries.
The NWA Access Module project was started in november 2000 with the appointment of a project manager. At the moment the project evaluates possible search engine vendors.
- Internet Archive - http://www.archive.org/
The Internet Archive is a 501(c)(3) public nonprofit that was founded to build an `Internet library,' with the purpose of offering free access to historical digital collections for researchers, historians, and scholars. Founded in 1996 and located in the Presidio of San Francisco, the Archive has been receiving data donations from Alexa Internet and others. In late 1999, the organization started to grow to build more well-rounded collections. The Internet archive uses the Alexa indexer to perform its crawls.
- EVA - http://www.lib.helsinki.fi/eva/english.html
EVA is a joint project by libraries, publishers and expert organizations, being part of the strategy program Education, Training and Research in the Information Society by the Finnish Ministry of Education. The central aim of the project is to create methods and tools to collect, register and archive electronic publications distributed on the Internet and to investigate conditions for long-term preservation of them in libraries. EVA co-operates closely with other projects managed by Helsinki University Library (projects such as ELEKTRA and MUISTI), as well as projects involving dissertations in Oulu and Jyväskylä University Libraries, all of which produce electronic networked publications.
- Pandora - http://www.nla.gov.au/policy/plan/pandora.html
With the increasing amount of material being produced by online electronic publishing, issues concerning the long-term
access and permanent archiving of such materials are creating great concern in academic and research communities
throughout the world. In Australia, many of the parties with a stake in access to digital publications are attempting to
understand the issues involved and to develop strategies to deal with them.
PANDORA is a project initiated by the National Library of Australia to investigate strategies for the storage, preservation
and access to digital data in the context of the creation of an electronic archive of library materials. This study is being
funded in part by the Australian Vice-Chancellors' Committee (AV-CC) Working Group on Electronic Publishing.
- CAMiLEON - http://www.si.umich.edu/CAMILEON/
Creative Archiving at Michigan and Leeds: Emulating the Old on the New.
The CAMiLEON Project will evaluate emulation as a digital preservation strategy by developing emulation tools, cost-benefit analysis and user evaluation. The project is a joint undertaking between the universities of Michigan (USA) and Leeds (UK) and is funded by JISC and NSF.
- CEDARS - http://www.leeds.ac.uk/cedars/
The Cedars Project is a Higher Education initiative funded by the Joint Information Systems Committee. It officially began on the 1st April 1998. The funding was awarded to the Consortium of University Research Libraries (CURL) and the work is carried out on behalf of CURL by three CURL institutions - Leeds, Oxford and Cambridge.
Cedars stands for "CURL exemplars in digital archives" and the main objective of the project is to address strategic, methodological and practical issues and provide guidance in best practice for digital preservation. It will do this by work on two levels - through practical demonstrator projects which will provide concrete practical experience in preserving digital resources and through strategic working groups based on broad concepts or concerns which will articulate preferences and make recommendations of benefit to the wider community. The main deliverables of the project will be recommendations and guidelines as well as practical robust and scaleable models for establishing distributed digital archives.
- Prism - http://prism.cornell.edu/PrismWeb/
Project Prism at Cornell University is a four-year effort to investigate and develop the policies and mechanisms needed for information integrity in digital libraries. The project focuses on the following areas: * Preservation - long term survivability of information in digital form.* Reliability - predictable availability of information resources and services. * Interoperability - open standards that allow widest sharing of information among providers and users. * Security - attention to both the privacy rights of users of information and the intellectual property rights of content creators. * Metadata -structured information that makes it possible to ensure information integrity in digital libraries.
- LOCKSS - http://lockss.stanford.edu/
LOCKSS (Lots Of Copies Keep Stuff Safe) is a system prototype to preserve access to scientific journals published on the Web. LOCKSS models how libraries provide access to paper content by allowing individual libraries to safeguard their communities access to Web content. The system ensures that hyperlinks continue to resolve and appropriate content is delivered, even when in the Internet the links don?t work and content is no longer available. Libraries running LOCKSS cooperate to detect and repair preservation failures. The project is funded by the Mellon Foundation, the National Science Foundation, Sun Microsystems, Inc., and Stanford University.
- Arches - http://www.rlg.org/strat/projarch.html
Arches-Archival Server and Test Bed: The name "Arches" is a concatenation of the term archival server, suggesting the bridges between various forms of metadata and the whole information objects they describe. It also is intended to reflect a number of doorways into the research resources made available by RLG.
In 1996, RLG launched a variety of efforts to build the conceptual and physical infrastructure to contain information in digital form, provide tools for meaningful access, and ensure long-term availability to the information and the means to access it. The Arches project focused on an online repository for digital resources of all types and the software environment that makes this repository internationally accessible and responsibly maintainable.
- InterPARES - http://www.interpares.org/
The InterPARES Project is a major international research initiative in which archival scholars, computer engineering scholars, national archival institutions and private industry representatives are collaborating to develop the theoretical and methodological knowledge required for the permanent preservation of authentic records created in electronic systems.
- Victorian Electronic Records Strategy - http://www.prov.vic.gov.au/vers/
The Victorian Electronic Records Strategy has been developed by the Victorian government to preserve the electronic records of the state for the long term. The Strategy is auspiced by Public Record Office Victoria (PROV). PROV is the state's archival authority and has the function of managing Victorian government records (via standard setting and provision of archival services) and preserving the state's heritage.
- National Library of Canada Electronic Collection - http://collection.nlc-bnc.ca/e-coll-e/index-e.htm
The Electronic Collection of the National Library of Canada (NLC) consists of Canadian books and periodicals
published online. It includes more than 3,000 titles published by both the commercial publishing sector and the
government publishing sector. The archived publications exist in various formats, including HTML, ASCII and
others, i.e., the format in which they were produced. In some cases, certain software such as Acrobat Reader
http://www.adobe.com, RealAudio http://www.realaudio.com, Ghostview
http://www.cs.wisc.edu/~ghost/index.html or others are required to read publication content.
Up to the AOLA Homepage