学位论文详细信息
Using P-Splines to Estimate Nonlinear Covariate Effects in Latent Factor Models.
latent factor model;P-spline;semiparametric method;EM algorithm;non-constant factor loading;overall exposure effect;Public Health;Statistics and Numeric Data;Health Sciences;Science;Biostatistics
Zhang, ZhenzhenBraun, Thomas M ;
University of Michigan
关键词: latent factor model;    P-spline;    semiparametric method;    EM algorithm;    non-constant factor loading;    overall exposure effect;    Public Health;    Statistics and Numeric Data;    Health Sciences;    Science;    Biostatistics;   
Others  :  https://deepblue.lib.umich.edu/bitstream/handle/2027.42/116762/zhzhang_1.pdf?sequence=2&isAllowed=y
瑞士|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

Latent factor models are useful for summarizing information among multiple outcomes. In this thesis I apply semiparametric methods based on P-splines to latent factor models in order to estimate and test non-constant factor loadings, as well as estimate and test the nonlinear relationship between an observed continuous predictor and multiple observed outcomes that measure a latent factor.In the first chapter, I develop a modeling strategy that estimates non-constant factor loadings as functions of multiple covariates. A highlight of my algorithm is the optimization of a type of generalized cross-validation criterion within each iteration of the EM algorithm for estimating the smoothing parameters of the splines. Through simulation studies I show the advantage of correctly estimating the non-constant factor loadings in reducing bias for the estimated factor score. I apply my model to studying the correlation among four highly correlated PM2.5 constituents.In the second chapter I examine the use of likelihood ratio test (LRT) in assessing whether a factor loading is constant. In order to take into account the estimation of smoothing parameters in my testing procedure, I use maximum likelihood approach to smooth the P-splines, which treats the spline coefficients as random and I test the variance of the spline coefficients. importance sampling to compute the likelihood. I use a data-driven chi-square mixture approximation as the null LRT distribution. The method is applied to estimating the underlying lead exposure represented by four types of lead measurements on mothers from the ELEMENT study.In the third chapter I use P-splines to estimate and test deviations of the latent factor mean from a linear trend. I also make the connection between my semiparametric latent factor model to a class of linear mixed models that estimate an overall exposure effect for multiple outcomes. My algorithm is based on standard linear mixed model and is implemented by adapting PROC MIXED from SAS into an iterative procedure. I apply my model to studying the lead exposure effect on children;;s behaviors as measured by the psychometric battery BASC-2.

【 预 览 】
附件列表
Files Size Format View
Using P-Splines to Estimate Nonlinear Covariate Effects in Latent Factor Models. 1310KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:22次