NEUROCOMPUTING | 卷:276 |
Handling missing data in multivariate time series using a vector autoregressive model-imputation (VAR-IM) algorithm | |
Article | |
Bashir, Faraj1  Wei, Hua-Liang2,3  | |
[1] Univ Sheffield, Dept Automat Control & Syst Engn, Mapping St, Sheffield S1 4DT, S Yorkshire, England | |
[2] Univ Sheffield, Dept Automat Control & Syst Engn, Mapping St, Sheffield S1 3JD, S Yorkshire, England | |
[3] Univ Sheffield, INSIGNEO Inst Silico Med, Mapping St, Sheffield S1 3JD, S Yorkshire, England | |
关键词: Missing data; EM algorithm; VAR model; ECG; | |
DOI : 10.1016/j.neucom.2017.03.097 | |
来源: Elsevier | |
【 摘 要 】
Imputing missing data from a multivariate time series dataset remains a challenging problem. There is an abundance of research on using various techniques to impute missing, biased, or corrupted values to a dataset. While a great amount of work has been done in this field, most imputing methodologies are centered about a specific application, typically involving static data analysis and simple time series modelling. However, these approaches fall short of desired goals when the data originates from a multivariate time series. The objective of this paper is to introduce a new algorithm for handling missing data from multivariate time series datasets. This new approach is based on a vector autoregressive (VAR) model by combining an expectation and minimization (EM) algorithm with the prediction error minimization (PEM) method. The new algorithm is called a vector autoregressive imputation method (VAR-IM). A description of the algorithm is presented and a case study was accomplished using the VAR-IM. The case study was applied to a real-world data set involving electrocardiogram (ECG) data. The VAR-IM method was compared with both traditional methods list wise deletion and linear regression substitution; and modern methods Multivariate Auto-Regressive State-Space (MARSS) and expectation maximization algorithm (EM). Generally, the VAR-IM method achieved significant improvement of the imputation tasks as compared with the other two methods. Although an improvement, a summary of the limitations and restrictions when using VAR-IM is presented. (c) 2017 Elsevier B.V. All rights reserved.
【 授权许可】
Free
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
10_1016_j_neucom_2017_03_097.pdf | 1463KB | download |