期刊论文

【摘要】

There have been growing needs for text processing, such as classifying, retrieving and clustering. The foundation of such a process is to extract features, which can best describe the text. Great progress has been made in text modelling. However, most of the text modelling methods are based only on the content, nor only on the attributes. Although there have been some combined models proposed in recent years, the lack of universality limits such models. In this study, the authors propose a uniform attribute-content model, which uses the attributes to influence the content feature extraction process. They design the attributes as a special filter to each feature extracted from the content. Thus the mixed features contain both content information and attribute information, which can describe the text more precise. They also propose a Monte Carlo method to solve this model. Experimental results on the Enron email dataset demonstrate the effectiveness of the authors’ proposed models.

【授权许可】

CC BY

【预览】

附件列表
Files	Size	Format	View
RO201911185539057ZK.pdf	1271KB	PDF	download

The Journal of Engineering
Uniform attribute-content model

¹ ¹ ¹ ¹
[1] National Key Laboratory of Science and Technology on Blind Signal Processing, Chengdu, People's Republic of China;
关键词: feature extraction; information retrieval; text analysis; Monte Carlo methods; uniform attribute-content model; text processing; text modelling methods; content feature extraction process; content information; attribute information; Monte Carlo method;
DOI : 10.1049/joe.2018.5135
来源: publisher
PDF


	文献评价指标
	下载次数：11次	浏览次数：1次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】