2018 5th International Conference on Advanced Composite Materials and Manufacturing Engineering | |
A Hybrid Feature Selection Method for Software Defect Prediction | |
Jia, Lina^1 | |
School of Information Science and Engineering, Zaozhuang University, Zaozhuang, China^1 | |
关键词: Hybrid feature selections; Information gain; Pearson correlation coefficients; Prediction model; Quality problems; Software defect prediction; Software modules; Software quality assurance; | |
Others : https://iopscience.iop.org/article/10.1088/1757-899X/394/3/032035/pdf DOI : 10.1088/1757-899X/394/3/032035 |
|
来源: IOP | |
【 摘 要 】
Software Defect Prediction (SDP) is one of the important ways of software quality assurance, which uses the metric data to predict whether software module is defect. The quality of data influences the perfection of the prediction model. The high latitude containing some unnecessary features is one of the quality problem that dataset. To solve this problem, we proposed a hybrid feature selection (HFS) method combined different feature sorting technology. Firstly, we calculate the values of each feature include chi-squared (cs), Information gain (IG) and Pearson Correlation coefficient, respectively. Secondly, we sort the features based on the ranking of the three values to select features. Finally, we use the random forest to build the model. In order to validity the approach, we did experiments on 5 datasets in NASA. The result shows that our approach can select a smaller subset of features to improve the preformation in F-measure.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
A Hybrid Feature Selection Method for Software Defect Prediction | 611KB | download |