next up previous contents
Next: The Harvester Up: Automatic Retrieval of Interactive Previous: Outline of the task   Contents


Modules

When acquiring dynamic web-pages, the focus is on the identification of an interaction and the generation of a suitable query. The possible values queries consist of are not known from the very beginning, they need to be generated. To assemble a query, either previously saved values are used, or possible values for the separate fields are deduced. Also, the resulting page will have to be scrutinised in order to decide whether the request was suitable and successful.

For the whole task, several separate functions can be discerned. When generating a single query they act in a very linear fashion carrying out one action after the other. These modules will be described in the following.



Subsections

Andreas Aschenbrenner