学位论文详细信息
Multimodal machine translation
multimodal machine translation;neural machine translation;multi-task learning;image captioning
Dave, Mihika ; Hockenmaier ; Julia
关键词: multimodal machine translation;    neural machine translation;    multi-task learning;    image captioning;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/101374/DAVE-THESIS-2018.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

Over the past few years, there has been a lot of progress being made in machine translation through deep learning networks. But there is relatively lesser progress made in using images to catalyze the translation tasks. In this study, we explore various models to incorporate the image features in the machine translation models. We start with a monomodal translation model which uses only textual features. We extend this model to develop the multimodal system which incorporates the visual features related to the source sentence. We also propose a multitask system which uses image captioning task to aid the translation task. Our models are tested on multiple datasets using the automatic evaluation metrics like METEOR and BLEU. The experiments show that the proposed models outperform the text-only baseline model.

【 预 览 】
附件列表
Files Size Format View
Multimodal machine translation 10274KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:6次