A Survey of Dimension Reduction Techniques | |
Fodor, I K | |
Lawrence Livermore National Laboratory | |
关键词: Data Analysis; Storage; Dimensions; Remote Sensing; 99 General And Miscellaneous//Mathematics, Computing, And Information Science; | |
DOI : 10.2172/15002155 RP-ID : UCRL-ID-148494 RP-ID : W-7405-ENG-48 RP-ID : 15002155 |
|
美国|英语 | |
来源: UNT Digital Library | |
【 摘 要 】
Advances in data collection and storage capabilities during the past decades have led to an information overload in most sciences. Researchers working in domains as diverse as engineering, astronomy, biology, remote sensing, economics, and consumer transactions, face larger and larger observations and simulations on a daily basis. Such datasets, in contrast with smaller, more traditional datasets that have been studied extensively in the past, present new challenges in data analysis. Traditional statistical methods break down partly because of the increase in the number of observations, but mostly because of the increase in the number of variables associated with each observation. The dimension of the data, is the number of variables that are measured on each observation. High-dimensional datasets present many mathematical challenges as well as some opportunities, and are bound to give rise to new theoretical developments. One of the problems with high-dimensional datasets is that, in many cases, not all the measured variables are ''important'' for understanding the underlying phenomena of interest. While certain computationally expensive novel methods can construct predictive models with high accuracy from high-dimensional data, it is still of interest in many applications to reduce the dimension of the original data prior to any modeling of the data. In this paper, we described several dimension reduction methods.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
15002155.pdf | 1360KB | download |