IEEE ICDM Workshop on Frequent Itemset Mining Implementations | |
DCI Closed: a Fast and Memory Efficient Algorithm to Mine Frequent Closed Itemsets | |
Claudio Lucchese ; Salvatore Orlando ; Raffaele Perego | |
Others : http://CEUR-WS.org/Vol-126/lucchese.pdf PID : 1780 |
|
来源: CEUR | |
【 摘 要 】
One of the main problems raising up in the frequent closed itemsets mining problem is the duplicate detection. In this paper we propose a general technique for promptly detecting and discarding duplicate closed itemsets, without the need of keeping in the main memory the whole set of closed patterns. Our approach can be exploited with substantial performance benefits by any algorithm that adopts a vertical representation of the dataset. We implemented our technique within a new depth-first closed itemsets mining algorithm. The experimental evaluation demonstrates that our algorithm outperforms other state of the art algorithms like CLOSET+ and FPCLOSE.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
DCI Closed: a Fast and Memory Efficient Algorithm to Mine Frequent Closed Itemsets | 243KB | download |