学位论文

【摘要】

In this thesis I introduce visual phrases, complex visual composites like ``a person riding a horse''.Visual phrases often display significantly reducedvisual complexity compared to their component objects, because the appearance of those objects can change profoundly when they participate in relations. I introduce a dataset suitable for phrasal recognition that uses familiar PASCAL object categories, and demonstrate significant experimental gains resulting from exploiting visual phrases.I show that a visual phrase detector significantly outperforms a baseline which detects component objects and reasons about relations, even though visual phrase training sets tend to be smaller than those for objects.I argue that any multi-class detection system must decode detector outputs to produce final results; this is usually done with non-maximum suppression.I describe a novel decoding procedure that can account accurately for local context without solving difficult inference problems.I show this decoding procedure outperforms the state of the art.Finally, I show that decoding a combination of phrasal and object detectors produces real improvements in detector results.

【预览】

附件列表
Files	Size	Format	View
Recognition using visual phrases	12045KB	PDF	download


Recognition using visual phrases
Visual Phrase;Phrasal Recognition;Visual Composites;Object Recognition
Sadeghi, Mohammad Amin ; Forsyth ; David A.
关键词: Visual Phrase; Phrasal Recognition; Visual Composites; Object Recognition;
Others : https://www.ideals.illinois.edu/bitstream/handle/2142/32060/Sadeghi_Mohammad_Amin.pdf?sequence=1&isAllowed=y
美国\|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF


	文献评价指标
	下载次数：19次	浏览次数：19次

【 摘 要 】

【 预 览 】

【摘要】

【预览】