Symmetry | |
Local Importance Representation Convolutional Neural Network for Fine-Grained Image Classification | |
Yadong Yang1  Xiaofeng Wang1  Hengzheng Zhang1  | |
[1] College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China; | |
关键词: fine-grained image classification; B-CNN; LIR-CNN; super-pixel segmentation convolution; local importance representation convolution; channelwise convolution; | |
DOI : 10.3390/sym10100479 | |
来源: DOAJ |
【 摘 要 】
Compared with ordinary image classification tasks, fine-grained image classification is closer to real-life scenes. Its key point is how to find the local areas with sufficient discrimination and perform effective feature learning. Based on a bilinear convolutional neural network (B-CNN), this paper designs a local importance representation convolutional neural network (LIR-CNN) model, which can be divided into three parts. Firstly, the super-pixel segmentation convolution method is used for the input layer of the model. It allows the model to receive images of different sizes and fully considers the complex geometric deformation of the images. Then, we replaced the standard convolution of B-CNN with the proposed local importance representation convolution. It can score each local area of the image using learning to distinguish their importance. Finally, channelwise convolution is proposed and it plays an important role in balancing lightweight network and classification accuracy. Experimental results on the benchmark datasets (e.g., CUB-200-2011, FGVC-Aircraft, and Stanford Cars) showed that the LIR-CNN model had good performance in fine-grained image classification tasks.
【 授权许可】
Unknown