2nd Nommensen International Conference on Technology and Engineering | |
K-Means algorithm and modification using gain ratio | |
Priyatna, Ryan Dhika^1 ; Tulus^1 ; Ramli, Marwan^1 | |
Computer Science, University of Sumatera Utara, Jl. Universitas Kampus USU, Medan | |
20155, Indonesia^1 | |
关键词: Cluster points; Data items; Data-source; Gain Ratio; Information gain; K-means; k-Means algorithm; | |
Others : https://iopscience.iop.org/article/10.1088/1757-899X/420/1/012133/pdf DOI : 10.1088/1757-899X/420/1/012133 |
|
来源: IOP | |
【 摘 要 】
K-Means is one method in data mining that can be used to perform grouping clustering of data. Accurate data processing can be done by processing the data source. Each collection or data warehouse can provide important knowledge into valuable information, constraints on this method, if the cluster point is chosen randomly so that the resulting data may vary, if the value is not good, then the resulting grouping is less than optimal. Furthermore, failure to outliers in the process of grouping data include determining whether a data item is an outliers of a cluster of course and whether small amounts of data form a separate cluster. where the gain ratio is used to calculate the attribute's influence on the target of a data gain ratio is the development of the information gain, where the gain ratio eliminates the bias value of each attribute. The result of the research is to calculate the weights in each attribute by using the gain ratio and make the modeling and classification into the method of K-means.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
K-Means algorithm and modification using gain ratio | 572KB | download |