Applied Sciences | |
SPUCL (Scientific Publication Classifier): A Human-Readable Labelling System for Scientific Publications | |
Alessandra Pieroni1  Michela Montorsi2  Noemi Scarpato2  | |
[1] Agency for Digital Italy (AgID), Smart Cities Service Via Liszt 21, 00144 Rome, Italy;Departement of Human Sciences and Promotion of the Quality of Life, San Raffaele Roma Open University, 00166 Rome, Italy; | |
关键词: document classification; human-readable labelling; Word2Vec; | |
DOI : 10.3390/app11199154 | |
来源: DOAJ |
【 摘 要 】
To assess critically the scientific literature is a very challenging task; in general it requires analysing a lot of documents to define the state-of-the-art of a research field and classifying them. The documents classifier systems have tried to address this problem by different techniques such as probabilistic, machine learning and neural networks models. One of the most popular document classification approaches is the LDA (Latent Dirichlet Allocation), a probabilistic topic model. One of the main issues of the LDA approach is that the retrieved topics are a collection of terms with their probabilities and it does not have a human-readable form. This paper defines an approach to make LDA topics comprehensible for humans by the exploitation of the Word2Vec approach.
【 授权许可】
Unknown