Sensors | |
Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems | |
Gang Chen1  Yuanli Cai2  | |
[1] School of Electronic and Information Engineering, Xi'an Jiaotong University, No.28, Xianning West Road, Xi'an 710049, China; | |
关键词: feature selection; seeding and harvest; noise injection; | |
DOI : 10.3390/s130100292 | |
来源: mdpi | |
【 摘 要 】
Feature selection, also known as attribute selection, is the technique of selecting a subset of relevant features for building robust object models. It is becoming more and more important for large-scale sensors applications with AI capabilities. The core idea of this paper is derived from a straightforward and intuitive principle saying that, if a feature subset (pattern) has more representativeness, it should be more self-organized, and as a result it should be more insensitive to artificially seeded noise points. In the light of this heuristic finding, we established the whole set of theoretical principles, based on which we proposed a two-stage framework to evaluate the relative importance of feature subsets, called seeding and harvest (S&H for short). At the first stage, we inject a number of artificial noise points into the original dataset; then at the second stage, we resort to an outlier detector to identify them under various feature patterns. The more precisely the seeded points can be extracted under a particular feature pattern, the more valuable and important the corresponding feature pattern should be. Besides, we compared our method with several state-of-the-art feature selection methods on a number of real-life datasets. The experiment results significantly confirm that our method can accomplish feature reduction tasks with high accuracy as well as low computing complexity.
【 授权许可】
CC BY
© 2013 by the authors; licensee MDPI, Basel, Switzerland.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202003190040048ZK.pdf | 925KB | download |