期刊论文详细信息
IEEE Access
Novelty Detection in Social Media by Fusing Text and Image Into a Single Structure
Evandro O. T. Salles1  Marta Amorim1  Patrick M. Ciarelli2  Frederico D. Bortoloti3  Daniel C. Cavalieri3 
[1] Electrical Engineering Department, Federal University of Esp&x00ED;ria, Brazil;rito Santo, Vit&x00F3;
关键词: Detection algorithms;    knowledge based systems;    learning systems;    machine learning;    neural networks;    classification algorithms;   
DOI  :  10.1109/ACCESS.2019.2939736
来源: DOAJ
【 摘 要 】

This work aims to propose an approach for detecting novelties, taking into account the temporal flow of data streams in social media. To this end, we present a completely new architecture for novelty detection. This new architecture entails three new contributions. First, we propose a new concept for novelty definition based on temporal windows. Second, we formulate an expression to determine the quality of a novelty. Third, we introduce a new approach to the fusion of heterogeneous data (image + text), using the COCO dataset and the MASK-RCNN convolutional neural network, which transforms image and text from social media into a single data format ready to be identified by machine learning algorithms. Since novelty detection is a task in which labeled samples are scarce or inexistent, unsupervised algorithms are used, and thus, the following baseline and state-of-the-art algorithms have been chosen: kNN, HBOS, FBagging, IForesting, and autoencoders. The new fusion approach is also compared to a state-of-the-art approach to outlier detection named AOM. Because of temporal particularities and the data types being fused, a new dataset was created, containing 27,494 tweets collected from Twitter. Our experiments show that data classification of social media using data fusion is superior to using only text or only images as input data.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:2次