会议论文

【摘要】

K-means method is limited in identifying and grouping the data by characteristics similarity in clustering. This study develops K-Means method with LSA to fix the issue of objectivity in data clustering as compared to K-Means method used lately. Data variables used are study load (credits) and study period (semester) of students in two academic years, which is amounted to 1,089 records. Data is analysed by using comparative statistic between the results of clustering test using K-Means method and K-Means method with LSA. The test findings show that the clustering using K-Means only groups the data into 3 clusters while the use of K-Means method with LSA produces 5 clusters. There are 327 different characteristics data identified by K-Means method with LSA which are grouped in two new clauses so it results in five clusters, for what is rated similar by K-Means method which only produces three clusters. This study concludes that K-Means method with LSA is more objective in clustering the data clustering and reducing MSE level error due to the sensitivity of data similarity within the cluster as always happened with K-Means method. Therefore, it is recommended that K-Means method with LSA be used in clustering to objectively identify the data and avoid any errors in the clustering process for more optimal data utilization.

【预览】

附件列表
Files	Size	Format	View
K-means method with linear search algorithm to reduce Means Square Error (MSE) within data clustering	752KB	PDF	download

3rd Annual Applied Science and Engineering Conference
K-means method with linear search algorithm to reduce Means Square Error (MSE) within data clustering
工业技术;自然科学
Sriadhi, S.^1 ; Gultom, S.^2 ; Martiano, M.^1 ; Rahim, R.^3 ; Abdullah, D.^4
Department of ITC Education, Universitas Negeri Medan, Medan, Indonesia^1
Department of Mathematics Education, Universitas Negeri Medan, Medan, Indonesia^2
Department of Informatics Technology, Institut Tekbologi Medan, Medan, Indonesia^3
Department of Informatics, Universitas Malikussaleh, Medan, Indonesia^4
关键词: Clustering process; Clustering test; Data clustering; Data similarity; Data variables; K-means method; Linear search algorithms; Means square errors;
Others : https://iopscience.iop.org/article/10.1088/1757-899X/434/1/012032/pdf DOI : 10.1088/1757-899X/434/1/012032

来源: IOP
PDF


	文献评价指标
	下载次数：16次	浏览次数：21次

【 摘 要 】

【 预 览 】

【摘要】

【预览】