科技报告详细信息
A Survey of Dimension Reduction Techniques
Fodor, I K
Lawrence Livermore National Laboratory
关键词: Data Analysis;    Storage;    Dimensions;    Remote Sensing;    99 General And Miscellaneous//Mathematics, Computing, And Information Science;   
DOI  :  10.2172/15002155
RP-ID  :  UCRL-ID-148494
RP-ID  :  W-7405-ENG-48
RP-ID  :  15002155
美国|英语
来源: UNT Digital Library
PDF
【 摘 要 】

Advances in data collection and storage capabilities during the past decades have led to an information overload in most sciences. Researchers working in domains as diverse as engineering, astronomy, biology, remote sensing, economics, and consumer transactions, face larger and larger observations and simulations on a daily basis. Such datasets, in contrast with smaller, more traditional datasets that have been studied extensively in the past, present new challenges in data analysis. Traditional statistical methods break down partly because of the increase in the number of observations, but mostly because of the increase in the number of variables associated with each observation. The dimension of the data, is the number of variables that are measured on each observation. High-dimensional datasets present many mathematical challenges as well as some opportunities, and are bound to give rise to new theoretical developments. One of the problems with high-dimensional datasets is that, in many cases, not all the measured variables are ''important'' for understanding the underlying phenomena of interest. While certain computationally expensive novel methods can construct predictive models with high accuracy from high-dimensional data, it is still of interest in many applications to reduce the dimension of the original data prior to any modeling of the data. In this paper, we described several dimension reduction methods.

【 预 览 】
附件列表
Files Size Format View
15002155.pdf 1360KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:19次