Now showing items 1-2 of 2

    • GROTOAP: GROund Truth for Open Access Publications 

      Tkaczyk, Dominika; Czeczko, Artur; Rusek, Krzysztof; Bolikowski, Łukasz; Bogacewicz, Roman (ACM, 2012-06)
      The field of digital document content analysis includes many important tasks, for example page segmentation or zone classification. It is impossible to build effective solutions for such problems and evaluate their performance ...
    • A modular metadata extraction system for born-digital articles 

      Tkaczyk, Dominika; Bolikowski, Łukasz; Czeczko, Artur; Rusek, Krzysztof (2012-03-27)
      We present a comprehensive system for extracting metadata from scholarly articles. In our approach the entire document is inspected, including headers and footers of all the pages as well as bibliographic references. ...