学位论文详细信息
Recognition using visual phrases
Visual Phrase;Phrasal Recognition;Visual Composites;Object Recognition
Sadeghi, Mohammad Amin ; Forsyth ; David A.
关键词: Visual Phrase;    Phrasal Recognition;    Visual Composites;    Object Recognition;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/32060/Sadeghi_Mohammad_Amin.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

In this thesis I introduce visual phrases, complex visual composites like ``a person riding a horse''.Visual phrases often display significantly reducedvisual complexity compared to their component objects, because the appearance of those objects can change profoundly when they participate in relations. I introduce a dataset suitable for phrasal recognition that uses familiar PASCAL object categories, and demonstrate significant experimental gains resulting from exploiting visual phrases.I show that a visual phrase detector significantly outperforms a baseline which detects component objects and reasons about relations, even though visual phrase training sets tend to be smaller than those for objects.I argue that any multi-class detection system must decode detector outputs to produce final results; this is usually done with non-maximum suppression.I describe a novel decoding procedure that can account accurately for local context without solving difficult inference problems.I show this decoding procedure outperforms the state of the art.Finally, I show that decoding a combination of phrasal and object detectors produces real improvements in detector results.

【 预 览 】
附件列表
Files Size Format View
Recognition using visual phrases 12045KB PDF download
  文献评价指标  
  下载次数:19次 浏览次数:19次