科技报告详细信息
Unlabeled Data Can Degrade Classification Performance of Generative Classifiers
Cozman, Fabio G. ; Cohen, Ira
HP Development Company
关键词: semi-supervised learning;    labeled and unlabeled data problem;    classification;    maximum-likelihood estimation;    EM algorithm;   
RP-ID  :  HPL-2001-234
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

This report analyzes the effect of unlabeled training data in generative classifiers. We are interested in classification performance when unlabeled data are added to an existing pool of labeled data. We show that there are situations where unlabeled data can degrade the performance of a classifier. We present an analysis of these situations and explain several seemingly disparate results in the literature. 16 Pages

【 预 览 】
附件列表
Files Size Format View
RO201804100002583LZ 273KB PDF download
  文献评价指标  
  下载次数:15次 浏览次数:62次