期刊论文详细信息
PATTERN RECOGNITION 卷:67
A new maximum relevance-minimum multicollinearity (MRmMC) method for feature selection and ranking
Article
Senawi, Azlyna1  Wei, Hua-Liang1,2  Billings, Stephen A.1,2 
[1] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield S1 3JD, S Yorkshire, England
[2] Univ Sheffield, INSIGNO Inst Silico Med, Sheffield S1 3JD, S Yorkshire, England
关键词: Dimensionality reduction;    Feature selection;    Classification;    Correlation measure;    Qualitative and quantitative variables;   
DOI  :  10.1016/j.patcog.2017.01.026
来源: Elsevier
PDF
【 摘 要 】

A substantial amount of datasets stored for various applications are often high dimensional with redundant and irrelevant features. Processing and analysing data under such circumstances is time consuming and makes it difficult to obtain efficient predictive models. There is a strong need to carry out analyses for high dimensional data in some lower dimensions, and one approach to achieve this is through feature selection. This paper presents a new relevancy-redundancy approach, called the maximum relevance minimum multicollinearity (MRmMC) method, for feature selection and ranking, which can overcome some shortcomings of existing criteria. In the proposed method, relevant features are measured by correlation characteristics based on conditional variance while redundancy elimination is achieved according to multiple correlation assessment using an orthogonal projection scheme. A series of experiments were conducted on eight datasets from the UCI Machine Learning Repository and results show that the proposed method performed reasonably well for feature subset selection. (C) 2017 Elsevier Ltd. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_patcog_2017_01_026.pdf 2042KB PDF download
  文献评价指标  
  下载次数:0次 浏览次数:0次