期刊论文详细信息
International Journal of Information Management Data Insights
A query expansion method based on topic modeling and DBpedia features
Abderrahim El Qadi1  Sarah Dahir2 
[1] ENSAM, Mohammed V University in Rabat, Morocco;IMAGE Laboratory, SCIAM Team, High School of Technology, Moulay Ismail University of Meknes, Morocco;
关键词: Information retrieval;    Query expansion;    DBpedia;    Term distribution;    Topic modeling;    Language model;   
DOI  :  
来源: DOAJ
【 摘 要 】

Query Expansion (QE) is a method used for improving Information Retrieval (IR) by adding the terms that are almost selected from feedback documents, and similar to the user query terms. But, due to the very small average number of query keywords; it is sometimes difficult to detect the context around the user query, and expand the query accordingly, especially when it contains ambiguous terms(i.e. polysemy terms). To this end, Linked Open Data (LOD) sources may be exploited. Yet, most attributes from linked data are multi-valued which makes a system unable to determine the right one(s) to use for expansion. And few other attributes are single-valued but too long and noisy to use directly. To deal with the previous issues, integration of the topic modeling process has been proposed to predict the latent semantic attribute-topics to use for expansion. This approach reconstructs candidate documents for a given query using distribution technique Bose-Einstein statistics (Bo1) and DBpedia attributes. The Latent Dirichlet Allocation(LDA) based topic models are then generated by considering these documents and the relevant expansion terms are then determined. The proposed method has been evaluated using the AP dataset collection, and the experiments revealed significant improvements according to the retrieval results using the distribution technique Bo1. Also, the proposed “LDA-LinkedBo1” approach outperformed DBpedia association based approaches in terms of MRR@N.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:5次