期刊论文

【摘要】

Linear discriminant analysis (LDA) is a well-known classification technique that enjoyed great success in practical applications. Despite its effectiveness for traditional low-dimensional problems, extensions of LDA are necessary in order to classify high dimensional data. Many variants of LDA have been proposed in the literature. However, most of these methods do not fully incorporate the structure information among predictors when such information is available. In this paper, we introduce a new high-dimensional LDA technique, namely graph-based sparse LDA (GSLDA), that utilizes the graph structure among the features. In particular, we use the regularized regression formulation for penalized LDA techniques, and propose to impose a structure-based sparse penalty on the discriminant vector beta. The graph structure can be either given or estimated from the training data. Moreover, we explore the relationship between the within-class feature structure and the overall feature structure. Based on this relationship, we further propose a variant of our proposed GSLDA to utilize effectively unlabeled data, which can be abundant in the semi-supervised learning setting. With the new regularization, we can obtain a sparse estimate of beta and more accurate and interpretable classifiers than many existing methods. Both the selection consistency of beta estimation and the convergence rate of the classifier are established, and the resulting classifier has an asymptotic Bayes error rate. Finally, we demonstrate the competitive performance of the proposed GSLDA on both simulated and real data studies. (C) 2018 Elsevier Inc. All rights reserved.

【授权许可】

Free

【预览】

附件列表
Files	Size	Format	View
10_1016_j_jmva_2018_12_007.pdf	3617KB	PDF	download

JOURNAL OF MULTIVARIATE ANALYSIS	卷:171
Graph-based sparse linear discriminant analysis for high-dimensional classification
Article
Liu, Jianyu¹ Yu, Guan² Liu, Yufeng^1,3,4,5
[1] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
[2] SUNY Buffalo, Dept Biostat, Buffalo, NY 14214 USA
[3] Univ N Carolina, Dept Genet, Chapel Hill, NC 27599 USA
[4] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
[5] Univ N Carolina, Carolina Ctr Genome Sci, Chapel Hill, NC 27599 USA
关键词: Feature structure; Gaussian graphical models; Regularization; Undirected graph;
DOI : 10.1016/j.jmva.2018.12.007
来源: Elsevier
PDF


	文献评价指标
	下载次数：1次	浏览次数：0次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】