期刊论文详细信息
Journal of King Saud University: Computer and Information Sciences 卷:34
Optimization driven cluster based indexing and matching for the document retrieval
Sanjay Kumar Jain1  Mamta Kayest2 
[1] Computer Engineering Department, National Institute of Technology, Kurukshetra, Haryana 136119, India;
[2] Corresponding author.;
关键词: Firefly algorithm;    Monarch butterfly optimization;    Stemming;    Holoentropy;    Cluster-based indexing;   
DOI  :  
来源: DOAJ
【 摘 要 】

Document retrieval methods concentrate on minimizing the time taken for the navigator to recall the entire document while analyzing the concepts, themes, and contents of the document based on their research goals. The exploitation of the repetitiveness in order to reduce the usage space is a hectic challenge. This paper proposes a document retrieval mechanism using an optimization, Monarch Butterfly optimization-based FireFly (MB-FF), developed with the integration of the Monarch Butterfly Optimization (MBO) and Firefly Algorithm (FA). The keywords from the documents are identified from the pre-processed document, which is pre-processed using stemming and stop word removal. The Term Frequency-Inverse Document Frequency (TF-IDF) is used in the extraction of the keywords and the concept of holoentropy is used in the selection of the significant keywords. The selected keywords assures the retrieval of the relevant documents, which initially is processed through cluster-based indexing using the Monarch Butterfly optimization-based firefly (MB-FF) that is followed with the two-level mod-Bhattacharya distance match. The performance of the MB-FF algorithm in document retrieval mechanism is evaluated using Precision, recall, and F-measure.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次