Journal of Big Data | |
Big data fuzzy C-means algorithm based on bee colony optimization using an Apache Hbase | |
Samad Paydar1  Mohsen Kahani1  Seyyed Mohammad Razavi1  | |
[1] Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran; | |
关键词: Clustering; Big data; Bee colony; MapReduce; | |
DOI : 10.1186/s40537-021-00450-w | |
来源: Springer | |
【 摘 要 】
Clustering algorithm analysis, including time and space complexity analysis, has always been discussed in the literature. The emergence of big data has also created a lot of challenges for this issue. Because of high complexity and execution time, traditional clustering techniques cannot be used for such an amount of data. This problem has been addressed in this research. To present the clustering algorithm using a bee colony algorithm and high-speed read/write performance, Map-Reduce architecture is used. Using this architecture allows the proposed method to cluster any volume of data, and there is no limit to the amount of data. The presented algorithm has good performance and high precision. The simulation results on 3 datasets show that the presented algorithm is more efficient than other big data clustering methods. Also, the results of our algorithm execution time on huge datasets are much better than other big data clustering approaches.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202107068502191ZK.pdf | 1730KB | download |