期刊论文

【摘要】

Image captioning has gradually gained attention in the field of artificial intelligence and become an interesting and challenging task for image understanding. It needs to identify important objects in images, extract attributes, tell relationships, and help the machine generate human-like descriptions. Recent works in deep neural networks have greatly improved the performance of image caption models. However, machines are still unable to imitate the way humans think, talk and communicate, so image captioning remains an ongoing task. It is thus very important to keep up with the latest research and results in the field of image captioning whereas publications on this topic are numerous. Our work aims to help researchers to have a macro-level understanding of image captioning from four aspects: spatial-temporal distribution characteristics, collaborative networks, trends in subject research, and historical evolutionary path. We employ scientometric visualization methods to achieve this goal. The results show that China has published the largest amount of publications in image captioning, but the United States has the greatest impact on research in this area. Besides, thirteen academic groups are identified in the field of image description, with institutions such as Microsoft, Google, Australian National University, and Georgia Institute of Technology being the most prominent research institutions. Meanwhile, we find that evaluation methods, datasets, novel image captioning models based on generative adversarial networks, reinforcement learning, and Transformer, as well as remote sensing image captioning, are the new research trends. Lastly, we conclude that image captioning research has gone through three major development stages from 2010 to 2020, and on this basis, we propose a more comprehensive taxonomy of image captioning.

【授权许可】

Unknown

IEEE Access
A Scientometric Visualization Analysis of Image Captioning Research From 2010 to 2020

Xiaoqiang Cheng¹ Kai Hu² Qing Luo³ Huayi Wu⁴ Wenxuan Liu⁴
[1] Faculty of Resources and Environmental Science, Hubei University, Wuhan, China;Key Laboratory of Advanced Process Control for Light Industry, Ministry of Education, Jiangnan University, Wuxi, China;School of Mathematics and Physics, Wuhan Institute of Technology, Wuhan, China;State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan, China;
关键词: Image captioning; image description generation; scientometric analysis; visualization;
DOI : 10.1109/ACCESS.2021.3129782
来源: DOAJ


	文献评价指标
	下载次数：0次	浏览次数：0次

【 摘 要 】

【 授权许可】

【摘要】

【授权许可】