IFS LogoInstitut für Softwaretechnik
Favoritenstr. 9 - 11 / 188
A - 1040 Wien
Tel.: (+43) 1 58801 18801, Fax.: (+43) 1 5040532

Arbeitsgruppe: Information & Software Engineering

SOMLib Data Files

Technical Report TR-IR98-1

Ver. 1.3.5 - 18. 07. 2000 (internal)

(History)

SOMLib Data Files - General Information

Basically there are 5 different types of data files which are used to create 6 different files namely:
  1. SOMLib Map Description File: SOMLib-Map-Descr
  2. SOMLib Weight Vector File:
  3. SOMLib Template Vector File: SOMLib Template Vector File
  4. SOMLib Unit Description File: SOMLib Unit Descriptions
  5. SOMLib Vector Description File: SOMLib Vector Descriptions
Alle of these files are built around the same basic structure which is defined as follows: In the following sections the 6 files are described in more detail, giving an idea of the contents and the intention of the file as well as its very structure in terms of the order of parameters as well as the distinction between mandatory (M) and optional (O) parameters. Furthermore, the relationships between the parameters are listed.

SOMLib Map Description File

Standard filename: XXX.map
Produced by: SOM training program
Modified by: SOM mapping program, SOM quant-error program
Demo-File: demo.mapdescr

This file describes the basic structure of the Self-Organizing Map, giving all the parameters used in the training process. It is initially written as result of the training process of the SOM. Additional Information attributes may be added as required by various programs.

Parameter Entries:

Back to Top.

SOMLib Weigth Vector File

Standard filename: XXX.wgt
Produced by: SOM init program, SOM training program
Modified by: -
Demo-File: demo.wgt

This file describes the weight vectors of the trained Self-Organizing Map. It is initially written as result of the SOM init program, read by the SOM training program as initialized map and finally written by the SOM training program after training
The files consists of two blocks, the first one describing the general SOM structure, the second giving the weight vectors of the SOM
The first 4 parameter entries are given as a sanity check to find out whether the given SOM map file and weight vector file match. If any of the 4 first parameters does not match the program should print a detailed error message and exit.

Parameter Entries:

Back to Top.

SOMLib Quantization Error Map File

Standard filename: XXX.err
Produced by: SOM quantization error program
Modified by: -
Demo-File: none

This file describes the quantization error vectors of the trained Self-Organizing Map. It is written by the SOM quantization error program based on a trained map and given input vectors
The files consists of two blocks, the first one describing the general SOM structure, the second giving the quantization error vectors of the SOM.
The file structure is identical to the general weight vector description file. The first 4 parameter entries are given as a sanity check to find out whether the given SOM map file and weight vector file match. If any of the 4 first parameters does not match the program should print a detailed error message and exit.

Parameter Entries:

Back to Top.

SOMLib Input Vector File

Standard filename: XXX.in
Produced by: Parser, Vector Generator
Modified by: -
Demo-File: demo.tfxidf

This file describes the input vectors to be used for the training process of a Self-Organizing Map. It is written by the parser or vector generator program creating the vector structure
The files consists of two blocks, the first one describing the input vectors in order to follow the general file structure of weight vector files, the second giving the input vectors
The file structure is identical to the SOMLib Weight Vector File. However, some semantical changes of the first 4 vector entries are as follows

Parameter Entries:

The remainder of the file is identical to the SOMLib Weigth Vector File:

Back to Top.

SOMLib Template Vector File

Standard filename: XXX.tv
Produced by: Parser, Vector Generator
Modified by: -
Demo-File: demo.tv

This file describes the template vectors providing the attribute structure of the input vectors used for the training process of a Self-Organizing Map. It is written by the parser or vector generator program creating the vector structure

Parameter Entries:

The remainder of this files lists the attributes of the vectors by 7 columns of information as follows

Back to Top.

SOMLib Unit Description File

Standard filename: XXX.unit
Produced by: SOM training program
Modified by: SOM mapping program, LabelSOM program
Demo-File: demo.unit

This file describes the units of the trained Self-Organizing Map. It is written by the SOM training program.
The files consists of two blocks, the first one describing the general SOM structure, the second giving a specific description of every unit
The first 3 parameter entries are given as a sanity check to find out whether the given SOM map file and weight vector file match. If any of the 3 first parameters does not match the program should print a detailed error message and exit.

Parameter Entries:

This header describes the general SOM structure.
Following this block, the second block contains the following set of attributes per unit:

Back to Top.

SOMLib Vector Description File

Standard filename: XXX.vec
Produced by: Parser or vector generator program
Modified by: SOM browsing software

This file describes the input vectors for a self-organizing map. It is written by the parser or vector generator program and describes the properties of each vector
The file consists of one set of attributes per vector with the very attributes still being subject to modification, or rather, extension. The structure of the description of the vectors follows in general the structure of the unit description file. Further attributes will be added as the necessity arises, especially in the context of metaphor graphics. Furthermore, the question whether each of the description files should be kept as an independet file or be part of one lare file comrising the whole collection has not been fully decided upon.
The attributes considered so far are:

Parameter Entries:

The header above describes the general file structure.
following this block, the second block contains the following set of attributes per vector/file:

Back to Top.

History

Back to Top.


Home
Comments: rauber@ifs.tuwien.ac.at