First International Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data | |
Mining Wikipedia's Snippets Graph: First Step to Build A New Knowledge Base | |
Andias Wira-Alam ; Brigitte Mathiak | |
Others : http://ceur-ws.org/Vol-868/paper6.pdf PID : 43543 |
|
来源: CEUR | |
【 摘 要 】
In this paper, we discuss the aspects of mining links and textsnippets from Wikipedia as a new knowledge base. Current knowledge base, e.g. DBPedia[1], covers mainly the structured part of Wikipedia, but not the content as a whole. Acting as a complement, we focus on ex- tracting information from the text of the articles. We extract a database of the hyperlinks between Wikipedia articles and populate them with the textual context surrounding each hyperlink. This would be useful for network analysis, e.g. to measure the inuence of one topic on another, or for question-answering directly (for stating the relationship between two entities). First, we describe the technical parts related to extracting the data from Wikipedia. Second, we specify how to represent the data ex- tracted as an extended triple through a Web service. Finally, we discuss the usage possibilities upon our expectation and also the challenges.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
Mining Wikipedia's Snippets Graph: First Step to Build A New Knowledge Base | 532KB | download |