期刊论文详细信息
CAAI Transactions on Intelligence Technology
Imputing missing values using cumulative linear regression
article
Samih M. Mostafa1 
[1] Faculty of Science, Mathematics Department, Computer Science, South Valley University
关键词: regression analysis;    mean square error methods;    estimation theory;    data handling;    cumulative linear regression;    statistical methods;    imputation algorithm;    linear regression technique;    imputed variables;    linear regression equation;    imputation time;    missing data handling;    missing values;    root-mean-square error;    mean absolute error;    coefficient of determination;    C1140Z Other topics in statistics;    C4130 Interpolation and function approximation (numerical analysis);    C6130 Data handling techniques;   
DOI  :  10.1049/trit.2019.0032
学科分类:数学(综合)
来源: Wiley
PDF
【 摘 要 】

The concept of missing data is important to apply statistical methods on the dataset. Statisticians and researchers may end up to an inaccurate illation about the data if the missing data are not handled properly. Of late, Python and R provide diverse packages for handling missing data. In this study, an imputation algorithm, cumulative linear regression, is proposed. The proposed algorithm depends on the linear regression technique. It differs from the existing methods, in that it cumulates the imputed variables; those variables will be incorporated in the linear regression equation to filling in the missing values in the next incomplete variable. The author performed a comparative study of the proposed method and those packages. The performance was measured in terms of imputation time, root-mean-square error, mean absolute error, and coefficient of determination . On analysing on five datasets with different missing values generated from different mechanisms, it was observed that the performances vary depending on the size, missing percentage, and the missingness mechanism. The results showed that the performance of the proposed method is slightly better.

【 授权许可】

CC BY|CC BY-ND|CC BY-NC|CC BY-NC-ND   

【 预 览 】
附件列表
Files Size Format View
RO202107100000053ZK.pdf 263KB PDF download
  文献评价指标  
  下载次数:16次 浏览次数:1次