期刊论文详细信息
International Journal of Physical Sciences
Text clustering on latent semantic indexing with particle swarm optimization (PSO) algorithm
Eisa Hasanzadeh1 
关键词: Vector space model;    particle swarm optimization (PSO) algorithm;    latent semantic indexing;    text clustering;    adaptive inertia weight.;   
DOI  :  10.5897/IJPS11.692
学科分类:物理(综合)
来源: Academic Journals
PDF
【 摘 要 】

Most of web users use various search engines to get specific information. A key factor in the success of web search engines are their ability to rapidly find good quality results to the queries that are based on specific terms. This paper aims at retrieving more relevant documents from a huge corpus based on the required information. We propose aparticle swarm optimization algorithm based on latent semantic indexing (PSO+LSI) for text clustering. PSO family of bio-inspired algorithms has recently successfully been applied to a number of real word clustering problems. We use an adaptive inertia weight (AIW) that do proper exploration and exploitation in search space. PSO can merge with LSI to achieve best clustering accuracy and efficiency.This framework provides more relevant documents to the user and reduces the irrelevant documents. It would be seen thatfor all numbers of dimensions, PSO+LSI are faster than PSO+Kmeans algorithms using vector space model (VSM). It takes 22.3s for PSO+LSI method with 1000 terms to obtain its best performance on 150 dimensions.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201902017145985ZK.pdf 303KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:8次