会议论文

【摘要】

K-Means is one method in data mining that can be used to perform grouping clustering of data. Accurate data processing can be done by processing the data source. Each collection or data warehouse can provide important knowledge into valuable information, constraints on this method, if the cluster point is chosen randomly so that the resulting data may vary, if the value is not good, then the resulting grouping is less than optimal. Furthermore, failure to outliers in the process of grouping data include determining whether a data item is an outliers of a cluster of course and whether small amounts of data form a separate cluster. where the gain ratio is used to calculate the attribute's influence on the target of a data gain ratio is the development of the information gain, where the gain ratio eliminates the bias value of each attribute. The result of the research is to calculate the weights in each attribute by using the gain ratio and make the modeling and classification into the method of K-means.

【预览】

附件列表
Files	Size	Format	View
K-Means algorithm and modification using gain ratio	572KB	PDF	download

2nd Nommensen International Conference on Technology and Engineering
K-Means algorithm and modification using gain ratio

Priyatna, Ryan Dhika^1 ; Tulus^1 ; Ramli, Marwan^1
Computer Science, University of Sumatera Utara, Jl. Universitas Kampus USU, Medan
20155, Indonesia^1
关键词: Cluster points; Data items; Data-source; Gain Ratio; Information gain; K-means; k-Means algorithm;
Others : https://iopscience.iop.org/article/10.1088/1757-899X/420/1/012133/pdf DOI : 10.1088/1757-899X/420/1/012133

来源: IOP
PDF


	文献评价指标
	下载次数：20次	浏览次数：47次

【 摘 要 】

【 预 览 】

【摘要】

【预览】