期刊论文详细信息
Data Science and Engineering
Scalable Multi-grained Cross-modal Similarity Query with Interpretability
Derong Shen1  Mingdong Zhu2  Lixin Xu2  Xianfang Wang2 
[1] School of Computer Science & Engineering, Northeastern University, 110819, Shenyang, China;School of Computer Science & Technology, Henan Institute of Technology, 453703, Xinxiang, China;
关键词: Cross-modal;    Interpretability;    Multi-grained;    Similarity query ·Scalability;   
DOI  :  10.1007/s41019-021-00162-4
来源: Springer
PDF
【 摘 要 】

Cross-modal similarity query has become a highlighted research topic for managing multimodal datasets such as images and texts. Existing researches generally focus on query accuracy by designing complex deep neural network models and hardly consider query efficiency and interpretability simultaneously, which are vital properties of cross-modal semantic query processing system on large-scale datasets. In this work, we investigate multi-grained common semantic embedding representations of images and texts and integrate interpretable query index into the deep neural network by developing a novel Multi-grained Cross-modal Query with Interpretability (MCQI) framework. The main contributions are as follows: (1) By integrating coarse-grained and fine-grained semantic learning models, a multi-grained cross-modal query processing architecture is proposed to ensure the adaptability and generality of query processing. (2) In order to capture the latent semantic relation between images and texts, the framework combines LSTM and attention mode, which enhances query accuracy for the cross-modal query and constructs the foundation for interpretable query processing. (3) Index structure and corresponding nearest neighbor query algorithm are proposed to boost the efficiency of interpretable queries. (4) A distributed query algorithm is proposed to improve the scalability of our framework. Comparing with state-of-the-art methods on widely used cross-modal datasets, the experimental results show the effectiveness of our MCQI approach.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202108129302032ZK.pdf 2003KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:8次