Przeglądaj ICM UW według autora "Bolikowski, Łukasz"
-
Aspekty ekstrakcji autorów na podstawie metadanych dokumentów
Dendek, Piotr; Bolikowski, Łukasz (2011-08-10)Prezentacja opisuje rozwiązanie kwestii braku powiązania autor-dzieło (kontrybucja). Omówione zostaje elastycznie rozszerzalne narzędzie służące do przeprowadzenia odzyskiwania w/w połączenia z wykorzystaniem metadanych. ... -
Author disambiguation in the YADDA2 software platform
Dendek, Piotr Jan; Wojewódzki, Mariusz; Bolikowski, Łukasz (Springer, 2013)SYNAT platform powered by the YADDA2 architecture has been extended with the Author Disambiguation Framework and the Query Framework. The former framework clusters occurrences of contributor names into identities of authors, ... -
Comparing hierarchical mathematical document clustering against the Mathematics Subject Classification tree
Kuśmierczyk, Tomasz; Łukasik, Michał; Bolikowski, Łukasz; Nguyen, Hung Son (Springer, 2013)Mathematical publications are often labelled with Mathematical Subject Classification codes. These codes are grouped in a treelike hierarchy created by experts. In this paper we posit that this hierarchy is highly correlated ... -
Data model for analysis of scholarly documents in the MapReduce paradigm
Kawa, Adam; Bolikowski, Łukasz; Czeczko, Artur; Dendek, Piotr Jan; Tkaczyk, Dominika (Springer, 2013)At CEON ICM UW we are in possession of a large collection of scholarly documents that we store and process using MapReduce paradigm. One of the main challenges is to design a simple, but effective data model that fits ... -
Evaluation of Features for Author Name Disambiguation Using Linear Support Vector Machines
Dendek, Piotr Jan; Bolikowski, Łukasz; Łukasik, Michał (IEEE Computer Society Conference Publishing Services, 2012-03-27)Author name disambiguation allows to distinguish between two or more authors sharing the same name. In a previous paper, we have proposed a name disambiguation framework in which for each author name in each article we ... -
GROTOAP: GROund Truth for Open Access Publications
Tkaczyk, Dominika; Czeczko, Artur; Rusek, Krzysztof; Bolikowski, Łukasz; Bogacewicz, Roman (ACM, 2012-06)The field of digital document content analysis includes many important tasks, for example page segmentation or zone classification. It is impossible to build effective solutions for such problems and evaluate their performance ... -
Hierarchical, multi-label classification of scholarly publications: modifications of ML-KNN algorithm
Łukasik, Michał; Kuśmierczyk, Tomasz; Bolikowski, Łukasz; Nguyen, Hung Son (Springer, 2013)One of the common problems when dealing with digital libraries is lack of classification codes in some of the documents. In the following publication we deal with this problem in a multi-label, hierarchical case of Mathematics ... -
Methodology for evaluating citation parsing and matching
Fedoryszak, Mateusz; Bolikowski, Łukasz; Tkaczyk, Dominika; Wojciechowski, Krzysztof (Springer, 2013)Bibliographic references between scholarly publications contain valuable information for researchers and developers involved with digital repositories. They are indicators of topical similarity between linked texts, impact ... -
A modular metadata extraction system for born-digital articles
Tkaczyk, Dominika; Bolikowski, Łukasz; Czeczko, Artur; Rusek, Krzysztof (2012-03-27)We present a comprehensive system for extracting metadata from scholarly articles. In our approach the entire document is inspected, including headers and footers of all the pages as well as bibliographic references. ... -
The Neumann problem in an irregular domain
Bolikowski, Łukasz; Gokieli, Maria; Varchon, Nicolas (EMS, Interfaces and Free Boundaries, 2010)We consider the stability of patterns for the reaction-diffusion equation with Neumann boundary conditions in an irregular domain in ℝN, N ≥ 2, the model example being two convex regions connected by a small ‘hole’ in their ... -
Simulating Phase Transition Dynamics on Non-trivial Domains
Bolikowski, Łukasz; Gokieli, Maria (Lecture Notes in Computer Science, 2013)Our goal is to investigate the influence of the geometry and topology of the domain Ω on the solutions of the phase transition and other diffusion-driven phenomena in Ω, modeled e.g. by the Allen-Cahn, Cahn-Hilliard, ... -
Tagging Scientific Publications using Wikipedia and Natural Language Processing Tools. Comparison on the ArXiv Dataset
Łopuszyński, Michał; Bolikowski, Łukasz (Springer, 2014)In this work, we compare two simple methods of tagging scientific publications with labels reflecting their content. As a first source of labels Wikipedia is employed, second label set is constructed from the noun phrases ... -
Towards a flexible author name disambiguation framework
Bolikowski, Łukasz; Dendek, Piotr Jan (Masaryk University Press, 2011-06-27)In this paper we propose a flexible, modular framework for author name disambiguation. Our solution consists of the core which orchestrates the disambiguation process, and replaceable modules performing concrete tasks. The ... -
Towards robust tags for scientific publications from natural language processing tools and Wikipedia
Łopuszyński, Michał; Bolikowski, Łukasz (Springer, 2015)In this work, two simple methods of tagging scientific publications with labels reflecting their content are presented and compared. As a first source of labels, Wikipedia is employed. A second label set is constructed from ... -
Workflow of metadata extraction from retro-born-digital documents
Tkaczyk, Dominika; Bolikowski, Łukasz (2011-06-27)In this work-in-progress report we propose a workflow for metadata extraction from articles in a digital form. We decompose the problem into clearly defined sub-tasks and outline possible implementations of the sub-tasks. ... -
Workflow of metadata extraction from retro-born-digital documents
Tkaczyk, Dominika; Bolikowski, Łukasz (2011-07-13) -
YADDA2 – Assemble Your Own Digital Library Application from Lego Bricks
Sylwestrzak, Wojtek; Rosiek, Tomasz; Bolikowski, Łukasz (ACM, 2012-06)YADDA2 is an open software platform which facilitates creation of digital library applications. It consists of versatile building blocks providing, among others: storage, relational and full-text indexing, process management, ...