期刊论文详细信息
Journal of Computer Science
A framework to Deal with Missing Data in Data Sets | Science Publications
Mohannad Najjar1  Luai A. Shalabi1  Ahmad A. Kayed1 
关键词: Data mining;    missing data;    rules;    reducts;    coverage;   
DOI  :  10.3844/jcssp.2006.740.745
学科分类:计算机科学(综合)
来源: Science Publications
PDF
【 摘 要 】

Most information systems usually have some missing values due to unavailable data. Missing values minimizing the quality of classification rules generated by a data mining system. Missing vales also affecting the quantity of classification rules achieved by the data mining system. Missing values could influence the coverage percentage and number of reducts generated. Missing values lead to the difficulty of extracting useful information from that data set. Solving the problem of missing data is of a high priority in the field of data mining and knowledge discovery. Replacing missing values by a specific value should not affect the quality of the data. Four different models for dealing with missing data were studied. A framework is established that remove inconsistencies before and after filling the attributes of missing values with the new expected value as generated by one of the four models. Comparative results were discussed and recommendations were concluded.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201911300666231ZK.pdf 102KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:1次