Szukaj
Wyświetlanie pozycji 1-2 z 2
GROTOAP: GROund Truth for Open Access Publications
(ACM, 2012-06)
The field of digital document content analysis includes many important tasks, for example page segmentation or zone classification. It is impossible to build effective solutions for such problems and evaluate their performance ...
Towards robust tags for scientific publications from natural language processing tools and Wikipedia
(Springer, 2015)
In this work, two simple methods of tagging
scientific publications with labels reflecting their content
are presented and compared. As a first source of labels,
Wikipedia is employed. A second label set is constructed
from ...