学位论文详细信息
New results in dimension reduction and model selection
Hlle;Ltsa;Manifold learning;Model selection criteria
Smith, Andrew Korb ; Industrial and Systems Engineering
University:Georgia Institute of Technology
Department:Industrial and Systems Engineering
关键词: Hlle;    Ltsa;    Manifold learning;    Model selection criteria;   
Others  :  https://smartech.gatech.edu/bitstream/1853/22586/1/smith_andrew_k_200805_phd.pdf
美国|英语
来源: SMARTech Repository
PDF
【 摘 要 】

Dimension reduction is a vital tool in many areas of applied statistics in which the dimensionality of the predictors can be large.In such cases, many statistical methods will fail or yield unsatisfactory results.However, many data sets of high dimensionality actually contain a much simpler, low-dimensional structure.Classical methods such as principal components analysis are able to detect linear structures very effectively, but fail in the presence of nonlinear structures.In the first part of this thesis, we investigate the asymptotic behavior of two nonlinear dimensionality reduction algorithms, LTSA and HLLE.In particular, we show that both algorithms, under suitable conditions, asymptotically recover the true generating coordinates up to an isometry.We also discuss the relative merits of the two algorithms, and the effects of the underlying probability distributions of the coordinates on their performance.Model selection is a fundamental problem in nearly all areas of applied statistics.In particular, a balance must be achieved between good in-sample performance and out-of-sample prediction.It is typically very easy to achieve good fit in the sample data, but empirically we often find that such models will generalize poorly.In the second part of the thesis, we propose a new procedure for the model selection problem which generalizes traditional methods.Our algorithm allows the combination of existing model selection criteria via a ranking procedure, leading to the creation of new criteria which are able to combine measures of in-sample fit and out-of-sample prediction performance into a single value.We then propose an algorithm which provably finds the optimal combination with a specified probability.We demonstrate through simulations that these new combined criteria can be substantially more powerful than any individual criterion.

【 预 览 】
附件列表
Files Size Format View
New results in dimension reduction and model selection 840KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:16次