期刊论文详细信息
Entropy
Selection of the Optimal Number of Topics for LDA Topic Model—Taking Patent Policy Analysis as an Example
Jingxian Gan1  Yong Qi1 
[1] School of Intellectual Property, Nanjing University of Science and Technology, Nanjing 210094, China;
关键词: LDA;    topic model;    optimal number of topics;    patent policy;   
DOI  :  10.3390/e23101301
来源: DOAJ
【 摘 要 】

This study constructs a comprehensive index to effectively judge the optimal number of topics in the LDA topic model. Based on the requirements for selecting the number of topics, a comprehensive judgment index of perplexity, isolation, stability, and coincidence is constructed to select the number of topics. This method provides four advantages to selecting the optimal number of topics: (1) good predictive ability, (2) high isolation between topics, (3) no duplicate topics, and (4) repeatability. First, we use three general datasets to compare our proposed method with existing methods, and the results show that the optimal topic number selection method has better selection results. Then, we collected the patent policies of various provinces and cities in China (excluding Hong Kong, Macao, and Taiwan) as datasets. By using the optimal topic number selection method proposed in this study, we can classify patent policies well.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:2次