会议论文详细信息
High Performance Computing Symposium 2013 | |
Implementation of a solution Cloud Computing with MapReduce model | |
计算机科学;物理学 | |
Baya, Chalabi^1 | |
Higher National School of Computer Science, Algeria^1 | |
关键词: Cloud infrastructures; Data analysis algorithms; High availability; K; means clustering; Large-scale computer systems; MapReduce models; Programming models; Very large datum; | |
Others : https://iopscience.iop.org/article/10.1088/1742-6596/540/1/012004/pdf DOI : 10.1088/1742-6596/540/1/012004 |
|
学科分类:计算机科学(综合) | |
来源: IOP | |
【 摘 要 】
In recent years, large scale computer systems have emerged to meet the demands of high storage, supercomputing, and applications using very large data sets. The emergence of Cloud Computing offers the potentiel for analysis and processing of large data sets. Mapreduce is the most popular programming model which is used to support the development of such applications. It was initially designed by Google for building large datacenters on a large scale, to provide Web search services with rapid response and high availability. In this paper we will test the clustering algorithm K-means Clustering in a Cloud Computing. This algorithm is implemented on MapReduce. It has been chosen for its characteristics that are representative of many iterative data analysis algorithms. Then, we modify the framework CloudSim to simulate the MapReduce execution of K-means Clustering on different Cloud Computing, depending on their size and characteristics of target platforms. The experiment show that the implementation of K-means Clustering gives good results especially for large data set and the Cloud infrastructure has an influence on these results.【 预 览 】
Files | Size | Format | View |
---|---|---|---|
Implementation of a solution Cloud Computing with MapReduce model | 795KB | download |