PATTERN RECOGNITION | 卷:47 |
Good recognition is non-metric | |
Article | |
Scheirer, Walter J.1,2,6  Wilber, Michael J.3  Eckmann, Michael4  Boult, Terrance E.5,6  | |
[1] Harvard Univ, Sch Engn & Appl Sci, Dept Mol & Cellular Biol, Cambridge, MA 02138 USA | |
[2] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA | |
[3] Cornell Univ, Ithaca, NY 14853 USA | |
[4] Skidmore Coll, Dept Math & Comp Sci, Saratoga Springs, NY 12866 USA | |
[5] Securics Inc, Colorado Springs, CO 80918 USA | |
[6] Univ Colorado, Colorado Springs, CO 80933 USA | |
关键词: Machine learning; Metric learning; Recognition; Computer vision; Face recognition; Object recognition; | |
DOI : 10.1016/j.patcog.2014.02.018 | |
来源: Elsevier | |
【 摘 要 】
Recognition is the fundamental task of visual cognition, yet how to formalize the general recognition problem for computer vision remains an open issue. The problem is sometimes reduced to the simplest case of recognizing matching pairs, often structured to allow for metric constraints. However, visual recognition is broader than just pair-matching: what we learn and how we learn it has important implications for effective algorithms. In this review paper, we reconsider the assumption of recognition as a pair-matching test, and introduce a new formal definition that captures the broader context of the problem. Through a meta-analysis and an experimental assessment of the top algorithms on popular data sets, we gain a sense of how often metric properties are violated by recognition algorithms. By studying these violations, useful insights come to light: we make the case for local distances and systems that leverage outside information to solve the general recognition problem. (C) 2014 Elsevier Ltd. All rights reserved.
【 授权许可】
Free
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
10_1016_j_patcog_2014_02_018.pdf | 1071KB | download |