学位论文

【摘要】

This dissertation studies the problem of similarity learning in the era of big data with heavy emphasis on real-world applications in social media. As in the saying “birds of a feather flock together,” in similarity learning, we aim to identify the notion of being similar in a data-driven and task-specific way, which is a central problem for maximizing the value of big data. Despite many successes of similarity learning from past decades, social media networks as one of the most typical big data media contain large-volume, various and high-velocity data, which makes conventional learning paradigms and off- the-shelf algorithms insufficient. Thus, we focus on addressing the emerging challenges brought by the inherent “three-Vs” characteristics of big data by answering the following questions: 1) Similarity is characterized by both links and node contents in networks; how to identify the contribution of each network component to seamlessly construct an application orientated similarity function? 2) Social media data are massive and contain much noise; how to efficiently learn the similarity between node pairs in large and noisy environments? 3) Node contents in social media networks are multi-modal; how to effectively measure cross-modal similarity by bridging the so-called “semantic gap”? 4) User wants and needs, and item characteristics, are continuously evolving, which generates data at an unprecedented rate; how to model the nature of temporal dynamics in principle and provide timely decision makings? The goal of this dissertation is to provide solutions to these questions via innovative research and novel methods. We hope this dissertation sheds more light on similarity learning in the big data era and broadens its applications in social media.

【预览】

附件列表
Files	Size	Format	View
Similarity learning in the era of big data	3605KB	PDF	download


Similarity learning in the era of big data
Similarity Learning;Big Data;Large Volume Data;Multimodality Data;High-velocity Data;Large-scale;Supervised Similarity Learning;Network Embedding;Deep Embedding;Heterogeneous Network;Streaming Network;Positive-Unlabeled Learning;Link Prediction;Recommendation;Social Media;Search and Retrieval
Chang, Shiyu
关键词: Similarity Learning; Big Data; Large Volume Data; Multimodality Data; High-velocity Data; Large-scale; Supervised Similarity Learning; Network Embedding; Deep Embedding; Heterogeneous Network; Streaming Network; Positive-Unlabeled Learning; Link Prediction; Recommendation; Social Media; Search and Retrieval;
Others : https://www.ideals.illinois.edu/bitstream/handle/2142/95303/CHANG-DISSERTATION-2016.pdf?sequence=1&isAllowed=y
美国\|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF


	文献评价指标
	下载次数：23次	浏览次数：14次

【 摘 要 】

【 预 览 】

【摘要】

【预览】