期刊论文详细信息
Journal of Big Data
Big data fuzzy C-means algorithm based on bee colony optimization using an Apache Hbase
Samad Paydar1  Mohsen Kahani1  Seyyed Mohammad Razavi1 
[1] Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran;
关键词: Clustering;    Big data;    Bee colony;    MapReduce;   
DOI  :  10.1186/s40537-021-00450-w
来源: Springer
PDF
【 摘 要 】

Clustering algorithm analysis, including time and space complexity analysis, has always been discussed in the literature. The emergence of big data has also created a lot of challenges for this issue. Because of high complexity and execution time, traditional clustering techniques cannot be used for such an amount of data. This problem has been addressed in this research. To present the clustering algorithm using a bee colony algorithm and high-speed read/write performance, Map-Reduce architecture is used. Using this architecture allows the proposed method to cluster any volume of data, and there is no limit to the amount of data. The presented algorithm has good performance and high precision. The simulation results on 3 datasets show that the presented algorithm is more efficient than other big data clustering methods. Also, the results of our algorithm execution time on huge datasets are much better than other big data clustering approaches.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202107068502191ZK.pdf 1730KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:1次