| International Journal of Physical Sciences | |
| Text clustering on latent semantic indexing with particle swarm optimization (PSO) algorithm | |
| Eisa Hasanzadeh1  | |
| 关键词: Vector space model; particle swarm optimization (PSO) algorithm; latent semantic indexing; text clustering; adaptive inertia weight.; | |
| DOI : 10.5897/IJPS11.692 | |
| 学科分类:物理(综合) | |
| 来源: Academic Journals | |
PDF
|
|
【 摘 要 】
Most of web users use various search engines to get specific information. A key factor in the success of web search engines are their ability to rapidly find good quality results to the queries that are based on specific terms. This paper aims at retrieving more relevant documents from a huge corpus based on the required information. We propose aparticle swarm optimization algorithm based on latent semantic indexing (PSO+LSI) for text clustering. PSO family of bio-inspired algorithms has recently successfully been applied to a number of real word clustering problems. We use an adaptive inertia weight (AIW) that do proper exploration and exploitation in search space. PSO can merge with LSI to achieve best clustering accuracy and efficiency.This framework provides more relevant documents to the user and reduces the irrelevant documents. It would be seen thatfor all numbers of dimensions, PSO+LSI are faster than PSO+Kmeans algorithms using vector space model (VSM). It takes 22.3s for PSO+LSI method with 1000 terms to obtain its best performance on 150 dimensions.
【 授权许可】
CC BY
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO201902017145985ZK.pdf | 303KB |
PDF