International Conference on Computing and Applied Informatics 2016 | |
Keyword and Event Extraction for Thematic Map Retrieval from Indonesian Online News Site | |
物理学;计算机科学 | |
Dewandaru, A.^1 ; Supriana, I.^1 ; Akbar, S.^1 | |
School of Electrical Engineering and Informatics, Institut Teknologi, Bandung, Indonesia^1 | |
关键词: Event extraction; Extraction process; Geographical information retrievals; Latent dirichlet allocations; Online news sites; Retrieval process; Semantic relatedness; Semantic similarity; | |
Others : https://iopscience.iop.org/article/10.1088/1742-6596/801/1/012068/pdf DOI : 10.1088/1742-6596/801/1/012068 |
|
学科分类:计算机科学(综合) | |
来源: IOP | |
【 摘 要 】
Online news sites provide great deal of information that may be extracted and mined to create Geographical Information Retrieval (GIR) system that provides thematic map based on user query. The task requires extracting generic events and its attributes from the news corpus. This event extraction requires the keywords and topics that can help increase the accuracy of the extraction and retrieval process. We prepare a large online news corpus and compared the Latent Dirichlet Allocation (LDA) and CBOW and skip-gram model to help providing base thematic keywords which can assist the extraction process on the intended GIR system. LDA is better in terms of semantic relatedness and the CBOW and skip-gram is useful for providing semantic similarity.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
Keyword and Event Extraction for Thematic Map Retrieval from Indonesian Online News Site | 773KB | download |