High Dimensional Correlation Networks And Their Applications.
Big Data;High Dimensional Data;Correlation Analysis;Time Series Analysis;Covariance Estimation;Dimensionality Reduction;Electrical Engineering;Engineering;Electrical Engineering: Systems
Analysis of interactions between variables in a large data set has recently attracted special attention in the context of high dimensional multivariate statistical analysis. Variable interactions play a role in many inference tasks, such as, classification, clustering, estimation, and prediction. This thesis focuses on the discovery of correlation and partial correlation structures as well as their applications in high dimensional data analysis and inference. The thesis considers problems of screening correlationand partial correlation networks by thresholding the sample correlation or the sample partial correlation matrix. The selection of the threshold is guided by our high dimensional asymptotic theory for screening such networks. Scalable methods of edge and hub screening are developed for applications in spatio-temporal analysis of time series, variable selection for linear prediction, and support recovery. The proposed methods are specifically designed for very high dimensional data with limited number of samples. Moreover, the correlation screening theory developed in this thesis provides high dimensional family-wise error rates on false discoveries.
【 预 览 】
附件列表
Files
Size
Format
View
High Dimensional Correlation Networks And Their Applications.