Szukaj
Wyświetlanie pozycji 1-2 z 2
Data model for analysis of scholarly documents in the MapReduce paradigm
(Springer, 2013)
At CEON ICM UW we are in possession of a large collection of scholarly documents that we store and process using MapReduce paradigm. One of the main challenges is to design a simple, but effective data model that fits ...
A modular metadata extraction system for born-digital articles
(2012-03-27)
We present a comprehensive system for extracting
metadata from scholarly articles. In our approach the entire
document is inspected, including headers and footers of all the
pages as well as bibliographic references. ...