期刊论文详细信息
JOURNAL OF MULTIVARIATE ANALYSIS 卷:111
Pattern recognition based on canonical correlations in a high dimension low sample size context
Article
Tamatani, Mitsuru2  Koch, Inge3  Naito, Kanta1 
[1] Shimane Univ, Dept Math, Hamada, Shimane, Japan
[2] Shimane Univ, Grad Sch Sci & Engn, Hamada, Shimane, Japan
[3] Univ Adelaide, Sch Math Sci, Adelaide, SA 5005, Australia
关键词: Canonical Correlations;    Consistency;    High Dimension Low Sample Size;    Misclassification;    Naive Bayes rule;   
DOI  :  10.1016/j.jmva.2012.04.011
来源: Elsevier
PDF
【 摘 要 】

This paper is concerned with pattern recognition for 2-class problems in a High Dimension Low Sample Size (HDLSS) setting. The proposed method is based on canonical correlations between the predictors X and responses Y. The paper proposes a modified version of the canonical correlation matrix Sigma(-1/2)(X) Sigma(XY) Sigma(-1/2)(Y) which is suitable for discrimination with class labels Y in a HDLSS context. The modified canonical correlation matrix yields ranking vectors for variable selection, a discriminant direction and a rule which is essentially equivalent to the naive Bayes rule. The paper examines the asymptotic behavior of the ranking vectors and the discriminant direction and gives precise conditions for HDLSS consistency in terms of the growth rates of the dimension and sample size. The feature selection induced by the discriminant direction as ranking vector is shown to work efficiently in simulations and in applications to real HDLSS data. (C) 2012 Elsevier Inc. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_jmva_2012_04_011.pdf 427KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:4次