期刊论文详细信息
IEEE Access
An Improved Imputation Method for Accurate Prediction of Imputed Dataset Based Radon Time Series
Masoud Alajmi1  Adil Aslam Mir2  Fahd N. Al-Wesabi3  Fatih Vehbi Celebi4  Lal Hussain5  Muhammad Rafique6  Anwer Mustafa Hilal7  Ahmed S. Almasoud8 
[1] , Ke&x00E7;Department of Computer Engineering, Ankara Y&x0131;i&x00F6;ld&x0131;m Beyaz&x0131;r&x0131;ren, Turkey;t University, Ayval&x0131;
关键词: Predictive mean matching;    missingness;    radon concentration;    support vector machine;    imputation;    IBFI;   
DOI  :  10.1109/ACCESS.2022.3151892
来源: DOAJ
【 摘 要 】

This article primarily focuses on the performance evaluation of a new methodology, imputation by feature importance (IBFI), to serve its imputed dataset in further regression scenarios when dealing with soil radon gas concentration (SRGC) time-series data. The time-series data have been collected spanning over fourteen(14) months period, which included four seismic events, and have been used for experimentation. The imputation by feature importance (IBFI) has been experimented and obtained results are found more efficient in the imputation of missing patterns in investigated time series when compared to traditionally used imputation methods viz. mean, median, mode, predictive mean matching (PMM), and hot-deck imputation.The IBFI methodology has been used in a variety of settings, such as data missing not at random (MNAR), missing completely at random (MCAR), and missing at random (MAR), with missingness percentages ranging from 10% to 30%. In this study, the imputed datasets, 9 for each imputation method, have been used further to predict the attribute of interest (radon concentration (RN)) keeping others as independent attributes such as thoron, temperature, relative humidity, and pressure time series. Support vector machine (SVM) with linear kernel has been used as a learning algorithm and its performance was evaluated based on the fact that how efficient and unbiased values were imputed. Statistical performance evaluation measures viz. root mean squared log error (RMSLE), root mean square error (RMSE), mean squared error (MSE),and mean absolute percentage error (MAPE) have been calculated for the assessment of performance. The findings of our study show that the IBFI imputed dataset has provided a better-fitted model. The model generation and predictions upon IBFI imputed time series result in more accurate predictions when compared to mean, median, mode, PMM, and hot-deck imputed time series. Furthermore, PMM and median imputed time series also perform closer to the IBFI imputed time series.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:1次