Entropy | |
On Using Linear Diophantine Equations for in-Parallel Hiding of Decision Tree Rules | |
Georgios Feretzakis1  Vassilios S. Verykios1  Dimitris Kalles1  | |
[1] School of Science and Technology, Hellenic Open University, Patras 263 35,Greece; | |
关键词: decision trees; privacy preserving; Diophantine equations; hiding rules; entropy; information gain; data sharing; | |
DOI : 10.3390/e21010066 | |
来源: DOAJ |
【 摘 要 】
Data sharing among organizations has become an increasingly common procedure in several areas such as advertising, marketing, electronic commerce, banking, and insurance sectors. However, any organization will most likely try to keep some patterns as hidden as possible once it shares its datasets with others. This paper focuses on preserving the privacy of sensitive patterns when inducing decision trees. We adopt a record augmentation approach to hide critical classification rules in binary datasets. Such a hiding methodology is preferred over other heuristic solutions like output perturbation or cryptographic techniques, which limit the usability of the data, since the raw data itself is readily available for public use. We propose a look ahead technique using linear Diophantine equations to add the appropriate number of instances while maintaining the initial entropy of the nodes. This method can be used to hide one or more decision tree rules optimally.
【 授权许可】
Unknown