2018 4th International Conference on Environmental Science and Material Application | |
Innovation of Cluster Method for Mixed Data Based on Specific Initialization Process and Attribute Weighting | |
生态环境科学;材料科学 | |
Ma, Xiaoqing^1 | |
Australian National University ANU Canberra, Australia^1 | |
关键词: Attribute weighting; Categorical attributes; Cluster method; Clustering evaluation; Density distributions; Exponential effects; K-prototype; Weight values; | |
Others : https://iopscience.iop.org/article/10.1088/1755-1315/252/5/052100/pdf DOI : 10.1088/1755-1315/252/5/052100 |
|
来源: IOP | |
【 摘 要 】
This paper proposes one improved K-Prototype algorithm based on innovations of controlling initialization process and attribute weighting (KP-IW) in order to deal with mixed data containing numeric and categorical attributes. Making initialization of clustering fixed and giving weightings to attributes are two common principles for improving algorithms. However, there are rarely methods regarding numeric or categorical proportion as one new attribute, which will affect the initialization consequence and weight value assigning to attribute because that density distribution of instances is calculated by the combing each attributes and those entire two proportions instead of only the former. There are some more detailed innovations for initialization and weighting, involving auxiliary point, auxiliary clusters and weightings combing linear and exponential effect. And it can be concluded that the KP-IW algorithm is suitable according to the clustering evaluation scores from KP-IW compared with others algorithms.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
Innovation of Cluster Method for Mixed Data Based on Specific Initialization Process and Attribute Weighting | 372KB | download |