学位论文详细信息
Contributions to Effect Size Analysis with Large Scale Data.
dissimilarity among effect sizes;functional summary;correlation path;Statistics and Numeric Data;Science;Statistics
Hsu, Ming-ChiZhu, Ji ;
University of Michigan
关键词: dissimilarity among effect sizes;    functional summary;    correlation path;    Statistics and Numeric Data;    Science;    Statistics;   
Others  :  https://deepblue.lib.umich.edu/bitstream/handle/2027.42/110392/mchsu_1.pdf?sequence=1&isAllowed=y
瑞士|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

Large and complex data are common to the modern life. These data sets are mines of information, statisticians are now developing the new statistical techniques to explore information from them. This dissertation contributes statistical methods to explore such challenging types of data sets.The second chapter estimates the dissimilarity among effect sizes in a regression model. A natural summary is the the ratio of the maximum magnitude to the minimum magnitude among the effects. For this nonstandard quantity, some standard techniques cannot be applied directly. Some procedures are discussed to improve the performance of point estimation and confidence intervals. We apply our procedures to the National Health and Nutrition Examination Survey (NHANES) from 2011 to 2012.The third chapter investigates functional summaries for a p by p covariance structure in an accessible and easily visualized form. The summaries reflect interpretable patterns in the data and are unaffected by relabeling of the variables. The proposed functional summaries allow us to visualize differences in the covariance structures between two data sets, even when they have different dimensions. Our summaries emphasize the degree by which each variable is predictable from the others, with a special focus on the number of variables required to predict another variable. We apply the functional summaries to two gene expression data sets, 108 normal heart tissue from the Cleveland Clinic Kaufman Center and 734 whole-blood RNA samples the from Estonian Biobank, to compare structures with different dimensions.The fourth chapter studies a projection-based approach for exploring conditional correlation paths. We propose a graphical tool that enables us to explore the change in dependence structure from marginal correlations to partial correlations. This path is built via adding information from others gradually to reach partial correlations. The projection-based proposed approach can be applied to another type of conditional correlation matrix which is conditioned on linear statistics of the data. We can explore the change in correlation matrices when the values of a linear statistics varied. We apply the approach to gene expression data set with 108 normal heart tissue from the Cleveland Clinic Kaufman Center.

【 预 览 】
附件列表
Files Size Format View
Contributions to Effect Size Analysis with Large Scale Data. 2023KB PDF download
  文献评价指标  
  下载次数:26次 浏览次数:54次