期刊论文

【摘要】

Hubness is a recently described aspect of the curse of dimensionality inherent to nearest-neighbor methods. This paper describes a new approach for exploiting the hubness phenomenon in k-nearest neighbor classification. We argue that some of the neighbor occurrences carry more information than others, by the virtue of being less frequent events. This observation is related to the hubness phenomenon and we explore how it affects high-dimensional k-nearest neighbor classification. We propose a new algorithm, Hubness Information k-Nearest Neighbor (HIKNN), which introduces the k-occurrence informativeness into the hubness-aware k-nearest neighbor voting framework. The algorithm successfully overcomes some of the issues with the previous hubness-aware approaches, which is shown by performing an extensive evaluation on several types of high-dimensional data.

【授权许可】

CC BY-NC-ND

【预览】

附件列表
Files	Size	Format	View
RO201904026037133ZK.pdf	825KB	PDF	download

Computer Science and Information Systems
Nearest Neighbor Voting in High Dimensional Data: Learning from Past Occurrences

Nenad Tomasev¹
[1] Artificial Intelligence Laboratory, Jozef Stefan Institute and Jozef Stefan International Postgraduate School
关键词: k-nearest neighbor; curse of dimensionality; hubness; neighbor occurrence models; self-information; fuzzy; voting;
DOI : 10.2298/CSIS111211014T
学科分类：社会科学、人文和艺术（综合）
来源: Computer Science and Information Systems
PDF


	文献评价指标
	下载次数：5次	浏览次数：6次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】