期刊论文

【摘要】

Dealing with missing data in data analysis is inevitable. Although powerful imputation methods that address this problem exist, there is still much room for improvement. In this study, we examined single imputation based on deep autoencoders, motivated by the apparent success of deep learning to efficiently extract useful dataset features. We have developed a consistent framework for both training and imputation. Moreover, we benchmarked the results against state-of-the-art imputation methods on different data sizes and characteristics. The work was not limited to the one-type variable dataset; we also imputed missing data with multi-type variables, e.g., a combination of binary, categorical, and continuous attributes. To evaluate the imputation methods, we randomly corrupted the complete data, with varying degrees of corruption, and then compared the imputed and original values. In all experiments, the developed autoencoder obtained the smallest error for all ranges of initial data corruption. (C) 2019 Elsevier B.V. All rights reserved.

【授权许可】

Free

【预览】

附件列表
Files	Size	Format	View
10_1016_j_neucom_2019_07_065.pdf	1857KB	PDF	download

NEUROCOMPUTING	卷:365
Establishing strong imputation performance of a denoising autoencoder in a wide range of missing data problems
Article
Abiri, Najmeh¹ Linse, Bjorn¹ Eden, Patrik¹ Ohlsson, Mattias^1,2
[1] Lund Univ, Dept Astron & Theoret Phys, Lund, Sweden
[2] Halmstad Univ, Ctr Appl Intelligent Syst Res, Halmstad, Sweden
关键词: Deep learning; Autoencoder; Imputation; Missing data;
DOI : 10.1016/j.neucom.2019.07.065
来源: Elsevier
PDF


	文献评价指标
	下载次数：0次	浏览次数：0次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】