The Journal of Engineering | |
Uniform attribute-content model | |
  1    1    1    1  | |
[1] National Key Laboratory of Science and Technology on Blind Signal Processing, Chengdu, People's Republic of China; | |
关键词: feature extraction; information retrieval; text analysis; Monte Carlo methods; uniform attribute-content model; text processing; text modelling methods; content feature extraction process; content information; attribute information; Monte Carlo method; | |
DOI : 10.1049/joe.2018.5135 | |
来源: publisher | |
【 摘 要 】
There have been growing needs for text processing, such as classifying, retrieving and clustering. The foundation of such a process is to extract features, which can best describe the text. Great progress has been made in text modelling. However, most of the text modelling methods are based only on the content, nor only on the attributes. Although there have been some combined models proposed in recent years, the lack of universality limits such models. In this study, the authors propose a uniform attribute-content model, which uses the attributes to influence the content feature extraction process. They design the attributes as a special filter to each feature extracted from the content. Thus the mixed features contain both content information and attribute information, which can describe the text more precise. They also propose a Monte Carlo method to solve this model. Experimental results on the Enron email dataset demonstrate the effectiveness of the authors’ proposed models.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201911185539057ZK.pdf | 1271KB | download |