科技报告详细信息
Collaborative Filtering on Skewed Datasets
Banerjee, Somnath ; Ramanathan, Krishnan
HP Development Company
关键词: collaborative filtering;    skewed dataset;    pLSA;   
RP-ID  :  HPL-2008-50
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

Many real life datasets have skewed distributions of events when the probability of observing few events far exceeds the others. In this paper, we observed that in skewed datasets the state of the art collaborative filtering methods perform worse than a simple probabilistic model. Our test bench includes a real ad click stream dataset which is naturally skewed. The same conclusion is obtained even from the popular movie rating dataset when we pose a binary prediction problem of whether a user will give maximum rating to a movie or not. Publication Info: Presented and published in Proceedings of WWW 2008, Beijing, China 2 Pages

【 预 览 】
附件列表
Files Size Format View
RO201804100002401LZ 83KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:14次