期刊论文详细信息
Information
Multimodal Sequential Fashion Attribute Prediction
HasanSait Arslan1  Kairit Sirts1  Mark Fishel1  Gholamreza Anbarjafari2 
[1] NLP Group, Institute of Computer Science, University of Tartu, 50090 Tartu, Estonia;iCV Lab, Institute of Technology, University of Tartu, 50090 Tartu, Estonia;
关键词: fashion e-commerce;    product attribute prediction;    multimodal classification;    sequential prediction;    cnn;    rnn;   
DOI  :  10.3390/info10100308
来源: DOAJ
【 摘 要 】

We address multimodal product attribute prediction of fashion items based on product images and titles. The product attributes, such as type, sub-type, cut or fit, are in a chain format, with previous attribute values constraining the values of the next attributes. We propose to address this task with a sequential prediction model that can learn to capture the dependencies between the different attribute values in the chain. Our experiments on three product datasets show that the sequential model outperforms two non-sequential baselines on all experimental datasets. Compared to other models, the sequential model is also better able to generate sequences of attribute chains not seen during training. We also measure the contributions of both image and textual input and show that while text-only models always outperform image-only models, only the multimodal sequential model combining both image and text improves over the text-only model on all experimental datasets.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次