学位论文详细信息
Image and video object selection
Object selection;Computer vision;Deep learning;Image segmentation;Video segmentation
Xu, Ning
关键词: Object selection;    Computer vision;    Deep learning;    Image segmentation;    Video segmentation;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/99515/XU-DISSERTATION-2017.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

Image and video object selection present fundamental research problems in the computer vision field and have many practical applications. They are important technologies in image and video editing, film production, robotics and autonomous driving etc. Previous methods have serious limitations for those tasks for several reasons. First, most of them use some low-level, handcrafted features which are not optimal. Second, they also lack the high-level understanding of "objectness" and semantics. Last but not the least, their generalization ability on unconstrained scenarios is very poor. Recently, deep learning has become the dominant method for computer vision tasks including recognition and detection since it cannot only learn good feature representation in an end-to-end manner but it is also effective at capturing the high-level semantics. However, its exploration in image and video object selection is stillimpoverished. Therefore, in this thesis we propose several novel deep-learning based methods to tackle the limitations in image and video object selection. Our algorithms are easy to understand and effective. Experimental results clearly demonstrate the superiority of our algorithms over previous methods. Some highlights include the following: (1) Our interactive segmentation algorithm is the first deep-learning based algorithm and achieves the state-of-the-art results on both small-scale and large-scale benchmarks. (2) Our rectangle-based algorithm novelly transforms rectangle inputs to attention-like distance maps and achieves robust performance for sloppy user selections or misplaced detection boxes.(3) Our image matting algorithm is the first to demonstrate the feasibility of learning an alpha matte end-to-end given an image and trimap. It also achieves state-of-the-art results on image matting and video matting benchmarks. (4) Our video object segmentation method combines CNN network with RNN memory cells to learn both good image feature representation and the temporal-spatial coherence.

【 预 览 】
附件列表
Files Size Format View
Image and video object selection 11516KB PDF download
  文献评价指标  
  下载次数:4次 浏览次数:12次