期刊论文详细信息
International Journal of Computational Intelligence Systems
Text Categorization Based on Topic Model
关键词: Topic model;    Latent Dirichlet allocation;    Variational Inference;    Category LanguageModel.;   
DOI  :  10.2991/ijcis.2009.2.4.8
来源: DOAJ
【 摘 要 】

In the text literature, many topic models were proposed to represent documents and words as topics or latent topics in order to process text effectively and accurately. In this paper, we propose LDACLM or Latent Dirichlet Allocation Category LanguageModel for text categorization and estimate parameters of models by variational inference. As a variant of Latent Dirichlet Allocation Model, LDACLM regards documents of category as Language Model and uses variational parameters to estimate maximum a posteriori of terms. In general, experiments show LDACLM model is effective and outperform Na¨?ve Bayes with Laplace smoothing and Rocchio algorithm but little inferior to SVM for text categorization.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:1次