next up previous
Next: Conclusions Up: Creating an Order in Previous: An experimental document archive

   
Experimental results

In order to simulate a distributed digital library, the documents of the CIA Worldfactbook have been split randomly into five parts consisting of 50 documents each. Note that five documents are thus assigned to two parts at a time. Each part of the library was then used for training of a separate $7 \times 7$ self-organizing map. Each of these maps represents a topologically ordered portion of its subset of the document library. Due to space restrictions we can only show one of these maps in this paper, see Figure 2. The other four maps, however, are rather similar in spirit.


  
Figure 2: Low level self-organizing map
\begin{figure}\begin{center}
\leavevmode
\epsfxsize=70mm
\epsffile{lowmap.eps}
\end{center}\end{figure}

In a second step, we integrated the five independent maps into one single $7 \times 7$ self-organizing map which now represents the complete document library. The map that results from this integration is shown in Figure 3. Note that the main clusters are clearly visible from the map representation, with the information from the low level maps being arranged according to the organizational principles of the high level map. As examples consider the area containing African countries in the left upper part of the map, the area of oil producing countries, the area of countries from Latin America, or the area of countries usually referred to as the first world in the right upper part. With the latter area, please note the explicit distinction into countries belonging to the Western hemisphere and those belonging to the Eastern hemisphere1. Finally, it is worth noting that the countries that are contained twice in the library are pairwise assigned to the same unit in the integrated map. These countries are Austria, Comoros, Iceland, Japan, and Mozambique.


  
Figure 3: Integration of the low level self-organizing maps
\begin{figure}\begin{center}
\leavevmode
\epsfxsize=95mm
\epsffile{topmap2.eps}
\end{center}\end{figure}

In a nutshell, the higher level map forms an orderly mapping based on the information contained in several low level maps, each of which representing a portion of the library. The higher level map thus is a convenient starting point for a user trying to find her orientation in a distributed digital library.


next up previous
Next: Conclusions Up: Creating an Order in Previous: An experimental document archive
Andreas RAUBER
1998-09-10