期刊论文详细信息
NEUROCOMPUTING 卷:365
Establishing strong imputation performance of a denoising autoencoder in a wide range of missing data problems
Article
Abiri, Najmeh1  Linse, Bjorn1  Eden, Patrik1  Ohlsson, Mattias1,2 
[1] Lund Univ, Dept Astron & Theoret Phys, Lund, Sweden
[2] Halmstad Univ, Ctr Appl Intelligent Syst Res, Halmstad, Sweden
关键词: Deep learning;    Autoencoder;    Imputation;    Missing data;   
DOI  :  10.1016/j.neucom.2019.07.065
来源: Elsevier
PDF
【 摘 要 】

Dealing with missing data in data analysis is inevitable. Although powerful imputation methods that address this problem exist, there is still much room for improvement. In this study, we examined single imputation based on deep autoencoders, motivated by the apparent success of deep learning to efficiently extract useful dataset features. We have developed a consistent framework for both training and imputation. Moreover, we benchmarked the results against state-of-the-art imputation methods on different data sizes and characteristics. The work was not limited to the one-type variable dataset; we also imputed missing data with multi-type variables, e.g., a combination of binary, categorical, and continuous attributes. To evaluate the imputation methods, we randomly corrupted the complete data, with varying degrees of corruption, and then compared the imputed and original values. In all experiments, the developed autoencoder obtained the smallest error for all ranges of initial data corruption. (C) 2019 Elsevier B.V. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_neucom_2019_07_065.pdf 1857KB PDF download
  文献评价指标  
  下载次数:0次 浏览次数:0次