会议论文详细信息
9th International Multidisciplinary Scientific and Research Conference "Modern Issues in Science and Technology" Workshop "Advanced Technologies in Aerospace, Mechanical and Automation Engineering"
Salient region detection in the task of visual question answering
自然科学;工业技术
Favorskaya, Margarita^1 ; Andreev, Vladimir^1 ; Popov, Aleksei^1
Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy ave., Krasnoyarsk
660037, Russia^1
关键词: Conventional approach;    Convolutional neural network;    High probability;    Human abilities;    Neural network application;    Question Answering;    Salient region detections;    Segmentation results;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/450/5/052017/pdf
DOI  :  10.1088/1757-899X/450/5/052017
来源: IOP
PDF
【 摘 要 】

Salient region detection in Visual Question Answering (VQA) is an attempt to simulate a human ability to quickly perceive a scene by selectively looking on image fragments instead of processing a whole scene. The conventional approach deals with a neural network application. However, the Convolutional Neural Networks (CNNs) have many disadvantages compared with traditional methods for salient region detection. We modified the basic algorithm of salient region detection for VQA task by selecting such image fragments, which have a high probability to be included in a questionnaire. The experiments have been conducted on images from MS-COCO dataset and provided good segmentation results.

【 预 览 】
附件列表
Files Size Format View
Salient region detection in the task of visual question answering 708KB PDF download
  文献评价指标  
  下载次数:23次 浏览次数:20次