期刊论文

【摘要】

Recently, convolutional neural network (CNN) has led to significant improvement in the field of computer vision, especially the improvement of the accuracy and speed of semantic segmentation tasks, which greatly improved robot scene perception. In this article, we propose a multilevel feature fusion dilated convolution network (Refine-DeepLab). By improving the space pyramid pooling structure, we propose a multiscale hybrid dilated convolution module, which captures the rich context information and effectively alleviates the contradiction between the receptive field size and the dilated convolution operation. At the same time, the high-level semantic information and low-level semantic information obtained through multi-level and multi-scale feature extraction can effectively improve the capture of global information and improve the performance of large-scale target segmentation. The encoder–decoder gradually recovers spatial information while capturing high-level semantic information, resulting in sharper object boundaries. Extensive experiments verify the effectiveness of our proposed Refine-DeepLab model, evaluate our approaches thoroughly on the PASCAL VOC 2012 data set without MS COCO data set pretraining, and achieve a state-of-art result of 81.73% mean interaction-over-union in the validate set.

【授权许可】

CC BY

【预览】

附件列表
Files	Size	Format	View
RO202108130004933ZK.pdf	2306KB	PDF	download

International Journal of Advanced Robotic Systems
Multilevel feature fusion dilated convolutional network for semantic segmentation
article
Tao Ku¹ Qirui Yang¹ Hao Zhang¹
[1] Shenyang Institute of Automation, Chinese Academy of Sciences;Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences;University of Chinese Academy of Sciences
关键词: Semantic segmentation; convolutional neural network; deep learning; computer vision; robot vision;
DOI : 10.1177/17298814211007665
学科分类：社会科学、人文和艺术（综合）
来源: InTech
PDF


	文献评价指标
	下载次数：14次	浏览次数：0次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】