期刊论文详细信息
NEUROCOMPUTING 卷:423
Combining pretrained CNN feature extractors to enhance clustering of complex natural images
Article
Guerin, Joris1  Thiery, Stephane2  Nyiri, Eric2  Gibaru, Olivier2  Boots, Byron3 
[1] Univ Fed Rio Grande do Norte, Natal, RN, Brazil
[2] Ecole Natl Arts & Metiers ParisTech, Lille, France
[3] Univ Washington, Seattle, WA 98195 USA
关键词: Image clustering;    Transfer clustering;    Multi-View clustering;   
DOI  :  10.1016/j.neucom.2020.10.068
来源: Elsevier
PDF
【 摘 要 】

Recently, a common starting point for solving complex unsupervised image classification tasks is to use generic features, extracted with deep Convolutional Neural Networks (CNN) pretrained on a large and versatile dataset (ImageNet). However, in most research, the CNN architecture for feature extraction is chosen arbitrarily, without justification. This paper aims at providing insight on the use of pretrained CNN features for image clustering (IC). First, extensive experiments are conducted and show that, for a given dataset, the choice of the CNN architecture for feature extraction has a huge impact on the final clustering. These experiments also demonstrate that proper extractor selection for a given IC task is difficult. To solve this issue, we propose to rephrase the IC problem as a multi-view clustering (MVC) problem that considers features extracted from different architectures as different views of the same data. This approach is based on the assumption that information contained in the different CNN may be complementary, even when pretrained on the same data. We then propose a multi-input neural network architecture that is trained end-to-end to solve the MVC problem effectively. This approach is tested on nine natural image datasets, and produces state-of-the-art results for IC. (c) 2020 Elsevier B.V. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_neucom_2020_10_068.pdf 2217KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:0次