期刊论文详细信息
PATTERN RECOGNITION 卷:72
Gaze latent support vector machine for image classification improved by weakly supervised region selection
Article
Wang, Xin1  Thome, Nicolas1,2  Cord, Matthieu1 
[1] UPMC Univ Paris 06, Sorbonne Univ, UMR 7606, LIP6, F-75005 Paris, France
[2] Conservatoire Natl Arts & Metiers CEDRIC, 292 Rue St Martin, F-75003 Paris, France
关键词: Weakly supervised learning;    Human gaze;    image classification;   
DOI  :  10.1016/j.patcog.2017.07.001
来源: Elsevier
PDF
【 摘 要 】

This paper deals with Weakly Supervised Learning (WSL), i.e. performing image classification by leveraging local information with models trained from global image labels. We propose a new WSL method which incorporates gaze features collected by an eye-tracker to guide the region selection policy. Our approach presents two appealing advantages: gaze features are cheap to collect, e.g. one order of magnitude faster than bounding boxes, and our method only requires gaze features during training, while being gaze free at test time. For this purpose, the training objective is enriched with a gaze loss, from which we derive a concave-convex upper bound, leading to an off-the-shelf CCCP optimization scheme. Extensive experiments are conducted to validate the effectiveness of the approach for WSL image classification on two public datasets with gaze annotation, i.e. PASCAL VOC 2012 action and POET. In addition, we publicly release a new food-related dataset, the Gaze-based UPMC Food dataset (UPMC-G20), which covers 20 food categories and 2,000 images. This dataset intends to promote the research in the food-related computer vision community. (C) 2017 Elsevier Ltd. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_patcog_2017_07_001.pdf 3606KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:0次