PATTERN RECOGNITION | 卷:67 |
A new maximum relevance-minimum multicollinearity (MRmMC) method for feature selection and ranking | |
Article | |
Senawi, Azlyna1 Wei, Hua-Liang1,2 Billings, Stephen A.1,2 | |
[1] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield S1 3JD, S Yorkshire, England | |
[2] Univ Sheffield, INSIGNO Inst Silico Med, Sheffield S1 3JD, S Yorkshire, England | |
关键词: Dimensionality reduction; Feature selection; Classification; Correlation measure; Qualitative and quantitative variables; | |
DOI : 10.1016/j.patcog.2017.01.026 | |
来源: Elsevier | |
![]() |
【 摘 要 】
A substantial amount of datasets stored for various applications are often high dimensional with redundant and irrelevant features. Processing and analysing data under such circumstances is time consuming and makes it difficult to obtain efficient predictive models. There is a strong need to carry out analyses for high dimensional data in some lower dimensions, and one approach to achieve this is through feature selection. This paper presents a new relevancy-redundancy approach, called the maximum relevance minimum multicollinearity (MRmMC) method, for feature selection and ranking, which can overcome some shortcomings of existing criteria. In the proposed method, relevant features are measured by correlation characteristics based on conditional variance while redundancy elimination is achieved according to multiple correlation assessment using an orthogonal projection scheme. A series of experiments were conducted on eight datasets from the UCI Machine Learning Repository and results show that the proposed method performed reasonably well for feature subset selection. (C) 2017 Elsevier Ltd. All rights reserved.
【 授权许可】
Free
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
10_1016_j_patcog_2017_01_026.pdf | 2042KB | ![]() |