Journal of Computer Science | |
Prototype-Based Sample Selection for Active Hashing | Science Publications | |
Cheong Hee Park1  | |
关键词: Active Hashing; Approximate Nearest Neighbors (ANN) Search; Hierarchical Clustering; Prototype-Based Sample Selection; Semi-Supervised Hashing; | |
DOI : 10.3844/jcssp.2015.839.844 | |
学科分类:计算机科学(综合) | |
来源: Science Publications | |
【 摘 要 】
Severalhashing-based methods for Approximate Nearest Neighbors (ANN) search in a largedata set have been proposed recently. In particular, semi-supervised hashingutilizes semantic similarity given for a small fraction of pairwise data samplesand active hashing aims to improve the performance for ANN search by relying onan expert for the labeling of the mostinformative points. In this study, we present an active hashing method byprototype-based sample selection. Knowing semantic similarities between clusterprototypes can help extracting relations among the points in the correspondingclusters. For expert labeling, we select prototypes from clusters which do notcontain any data points with labeled information so that all areas canbe covered effectively. Experimental results demonstrate that the proposedactive hashing method improves the performance for ANN search.
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201911300878171ZK.pdf | 391KB | download |