会议论文详细信息
The 2003 SIAM(Society for Industrial and Applied Mathematics) International Conference on Data Mining
Mixture models and frequent sets: combining global and local methods for 0 - 1 data
Jaakko Hollmén ; Jouni K. Seppänen ; Heikki Mannila
Others  :  http://www.siam.org/proceedings/datamining/2003/dm03_32HollmenJ.pdf
PID  :  19007
来源: CEUR
PDF
【 摘 要 】

We study the interaction between global and local techniques in data mining. Specifically, we study the collections of frequent sets in clusters produced by a probabilistic clustering using mixtures of Bernoulli models. That is, we first analyze 0 - 1 datasets by a global technique (probabilistic clustering using the EM algorithm) and then do a local analysis (discovery of frequent sets) in each of the clusters. The results indicate that the use of clustering as a preliminary phase in finding frequent sets produces clusters that have significantly different collections of frequent sets. We also test the significance of the differences in the frequent set collections in the different clusters by obtaining estimates of the underlying joint density. To get from the local patterns in each cluster back to distributions, we use the maximum entropy technique to obtain a local model for each cluster, and then combine these local models to get a mixture model. We obtain clear improvements to the approximation quality against the use of either the mixture model or the maximum entropy model.

【 预 览 】
附件列表
Files Size Format View
Mixture models and frequent sets: combining global and local methods for 0 - 1 data 323KB PDF download
  文献评价指标  
  下载次数:8次 浏览次数:3次