学位论文详细信息
Modifying sparse coding to model imbalanced datasets
Sparse coding;Imbalanced data;Machine learning
Whitaker, Bradley M. ; Anderson, David V. Electrical and Computer Engineering Rozell, Christopher J. Romberg, Justin K. Li, Wing Clifford, Gari D. ; Anderson, David V.
University:Georgia Institute of Technology
Department:Electrical and Computer Engineering
关键词: Sparse coding;    Imbalanced data;    Machine learning;   
Others  :  https://smartech.gatech.edu/bitstream/1853/59919/1/WHITAKER-DISSERTATION-2018.pdf
美国|英语
来源: SMARTech Repository
PDF
【 摘 要 】

The objective of this research is to explore the use of sparse coding as a tool for unsupervised feature learning to more effectively model imbalanced datasets. Traditional sparse coding dictionaries are learned by minimizing the average approximation error between a vector and its sparse decomposition. As such, these dictionaries may overlook important features that occur infrequently in the data. Without these features, it may be difficult to accurately classify between classes if one or more classes are not well-represented in the training data. To overcome this problem, this work explores novel modifications to the sparse coding dictionary learning framework that encourage dictionaries to learn anomalous features. Sparse coding also inherently assumes that a vector can be represented as a sparse linear combination of a feature set. This work addresses the ability of sparse coding to learn a representative dictionary when the underlying data has a nonlinear sparse structure. Finally, this work illustrates one benefit of improved signal modeling by utilizing sparse coding in three imbalanced classification tasks.

【 预 览 】
附件列表
Files Size Format View
Modifying sparse coding to model imbalanced datasets 955KB PDF download
  文献评价指标  
  下载次数:8次 浏览次数:7次