Now showing items 1-4 of 4
Application of Topic Models to Judgments from Public Procurement Domain
In this work, automatic analysis of themes contained in a large corpora of judgments from public procurement domain is performed. The employed technique is unsupervised latent Dirichlet allocation (LDA). In addition, it ...
Towards robust tags for scientific publications from natural language processing tools and Wikipedia
In this work, two simple methods of tagging scientific publications with labels reflecting their content are presented and compared. As a first source of labels, Wikipedia is employed. A second label set is constructed from ...
Unsupervised Keyword Extraction From Polish Legal Texts
In this work, we present an application of the recently pro- posed unsupervised keyword extraction algorithm RAKE to a corpus of Polish legal texts from the field of public procurement. RAKE is essen- tially a language ...
Tagging Scientific Publications using Wikipedia and Natural Language Processing Tools. Comparison on the ArXiv Dataset
In this work, we compare two simple methods of tagging scientific publications with labels reflecting their content. As a first source of labels Wikipedia is employed, second label set is constructed from the noun phrases ...