All CLEF-IP collections, as extracts of MAREC, are available under the Creative Commons License (see the paragraph below) and are now freely available to download.
|MAREC by IRF is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Permissions beyond the scope of this license may be available at mailto:firstname.lastname@example.org.|
The documents in the patent collection are stored as XML files.
The documents are derived from European Patent Office and have mixed content in English, German and French.
The files contain bibliographic data as well as descriptive text. The XML files are quite comprehensive, containing detailed information on inventors, assignees, priority dates etc. From the variety of information in the XML files, these are the elements you should start to look at: