期刊论文详细信息
International Journal of Information Technology
Optimizing the Capacity of a Convolutional Neural Network for Image Segmentation and Pattern Recognition
Yalong Jiang ; Zheru Chi
关键词: CNN;    capsule network;    capacity optimization;    character recognition;    data augmentation;    semantic segmentation.;   
DOI  :  10.1999/1307-6892/10009594
学科分类:计算机应用
来源: World Academy of Science, Engineering and Technology (W A S E T)
PDF
【 摘 要 】

In this paper, we study the factors which determine the capacity of a Convolutional Neural Network (CNN) model and propose the ways to evaluate and adjust the capacity of a CNN model for best matching to a specific pattern recognition task. Firstly, a scheme is proposed to adjust the number of independent functional units within a CNN model to make it be better fitted to a task. Secondly, the number of independent functional units in the capsule network is adjusted to fit it to the training dataset. Thirdly, a method based on Bayesian GAN is proposed to enrich the variances in the current dataset to increase its complexity. Experimental results on the PASCAL VOC 2010 Person Part dataset and the MNIST dataset show that, in both conventional CNN models and capsule networks, the number of independent functional units is an important factor that determines the capacity of a network model. By adjusting the number of functional units, the capacity of a model can better match the complexity of a dataset.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201910289037923ZK.pdf 438KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:16次