学位论文详细信息
Similarity learning in the era of big data
Similarity Learning;Big Data;Large Volume Data;Multimodality Data;High-velocity Data;Large-scale;Supervised Similarity Learning;Network Embedding;Deep Embedding;Heterogeneous Network;Streaming Network;Positive-Unlabeled Learning;Link Prediction;Recommendation;Social Media;Search and Retrieval
Chang, Shiyu
关键词: Similarity Learning;    Big Data;    Large Volume Data;    Multimodality Data;    High-velocity Data;    Large-scale;    Supervised Similarity Learning;    Network Embedding;    Deep Embedding;    Heterogeneous Network;    Streaming Network;    Positive-Unlabeled Learning;    Link Prediction;    Recommendation;    Social Media;    Search and Retrieval;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/95303/CHANG-DISSERTATION-2016.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

This dissertation studies the problem of similarity learning in the era of big data with heavy emphasis on real-world applications in social media. As in the saying “birds of a feather flock together,” in similarity learning, we aim to identify the notion of being similar in a data-driven and task-specific way, which is a central problem for maximizing the value of big data. Despite many successes of similarity learning from past decades, social media networks as one of the most typical big data media contain large-volume, various and high-velocity data, which makes conventional learning paradigms and off- the-shelf algorithms insufficient. Thus, we focus on addressing the emerging challenges brought by the inherent “three-Vs” characteristics of big data by answering the following questions: 1) Similarity is characterized by both links and node contents in networks; how to identify the contribution of each network component to seamlessly construct an application orientated similarity function? 2) Social media data are massive and contain much noise; how to efficiently learn the similarity between node pairs in large and noisy environments? 3) Node contents in social media networks are multi-modal; how to effectively measure cross-modal similarity by bridging the so-called “semantic gap”? 4) User wants and needs, and item characteristics, are continuously evolving, which generates data at an unprecedented rate; how to model the nature of temporal dynamics in principle and provide timely decision makings? The goal of this dissertation is to provide solutions to these questions via innovative research and novel methods. We hope this dissertation sheds more light on similarity learning in the big data era and broadens its applications in social media.

【 预 览 】
附件列表
Files Size Format View
Similarity learning in the era of big data 3605KB PDF download
  文献评价指标  
  下载次数:16次 浏览次数:13次