会议论文详细信息
2nd Nommensen International Conference on Technology and Engineering
K-Means algorithm and modification using gain ratio
Priyatna, Ryan Dhika^1 ; Tulus^1 ; Ramli, Marwan^1
Computer Science, University of Sumatera Utara, Jl. Universitas Kampus USU, Medan
20155, Indonesia^1
关键词: Cluster points;    Data items;    Data-source;    Gain Ratio;    Information gain;    K-means;    k-Means algorithm;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/420/1/012133/pdf
DOI  :  10.1088/1757-899X/420/1/012133
来源: IOP
PDF
【 摘 要 】

K-Means is one method in data mining that can be used to perform grouping clustering of data. Accurate data processing can be done by processing the data source. Each collection or data warehouse can provide important knowledge into valuable information, constraints on this method, if the cluster point is chosen randomly so that the resulting data may vary, if the value is not good, then the resulting grouping is less than optimal. Furthermore, failure to outliers in the process of grouping data include determining whether a data item is an outliers of a cluster of course and whether small amounts of data form a separate cluster. where the gain ratio is used to calculate the attribute's influence on the target of a data gain ratio is the development of the information gain, where the gain ratio eliminates the bias value of each attribute. The result of the research is to calculate the weights in each attribute by using the gain ratio and make the modeling and classification into the method of K-means.

【 预 览 】
附件列表
Files Size Format View
K-Means algorithm and modification using gain ratio 572KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:46次