期刊论文详细信息
Data Science Journal
An Improved Correlation-Based Algorithm with Discretization for Attribute Reduction in Data Clustering
S. Senthamarai Kannan2  N. Ramaraj1 
[1] Principal, G. K. M. Engineering College;Department of Information Technology, Thiagarajar College of Engineering
关键词: Clustering;    Attribute reduction;    Data discretization;    Correlation-based model;    Knowledge discovery;    Data mining;   
DOI  :  10.2481/dsj.007-044
学科分类:计算机科学(综合)
来源: Ubiquity Press Ltd.
PDF
【 摘 要 】

References(65)Attribute reduction aims to reduce the dimensionality of large scale data without losing useful information and is an important topic of knowledge discovery, data clustering, and classification. In this paper, we aim to solve the current problem that a continuous attribute in a clustering or classification algorithm must be made discrete. We propose a new algorithm of data reduction based on a correlation model with data discretization. It deals with selection of continuous attributes from a very large set of attributes. The proposed algorithm is an extended version of the Fast Correlation-based filter algorithm and is named FCBF+. The FCBF+ algorithm performs the discretization of continuous attributes in an efficient manner. Then it selects the relevant attributes from a very large set of attributes.Performance evaluation is done on clustering accuracy for all the features, and a reduced set of features is obtained using FCBF+. It is found that the proposed FCBF+ algorithm improves the clustering accuracy of various clustering algorithms.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201911300045346ZK.pdf 323KB PDF download
  文献评价指标  
  下载次数:17次 浏览次数:55次