Now showing items 1-3 of 3
Data model for analysis of scholarly documents in the MapReduce paradigm
At CEON ICM UW we are in possession of a large collection of scholarly documents that we store and process using MapReduce paradigm. One of the main challenges is to design a simple, but effective data model that fits ...
Methodology for evaluating citation parsing and matching
Bibliographic references between scholarly publications contain valuable information for researchers and developers involved with digital repositories. They are indicators of topical similarity between linked texts, impact ...
A modular metadata extraction system for born-digital articles
We present a comprehensive system for extracting metadata from scholarly articles. In our approach the entire document is inspected, including headers and footers of all the pages as well as bibliographic references. ...