Privacy Preserving Clustering On Horizontally Partitioned Data
İnan, Ali and Kaya, Selim Volkan and Saygın, Yücel and Savaş, Erkay and Hintoğlu, Ayça Azgın and Levi, Albert (2005) Privacy Preserving Clustering On Horizontally Partitioned Data. (Submitted)
Official URL: http://ieeexplore.ieee.org/iel5/10810/34089/01623890.pdf
Data mining has been a popular research area for more than a decade due to its vast spectrum of applications.. However, the popularity and wide availability of data mining tools also raised concerns about the privacy of individuals. The aim of privacy preserving data mining researchers is to develop data mining techniques that could be applied on databases without violating the privacy of individuals. Privacy preserving techniques for various data mining models have been proposed, initially for classification on centralized data then for association rules in distributed environments. In this work, we propose methods for constructing the dissimilarity matrix of objects from different sites in a privacy preserving manner which can be used for privacy preserving clustering as well as database joins, record linkage and other operations that require pair-wise comparison of individual private data objects horizontally distributed to multiple sites. We show communication and computation complexity of our protocol by conducting experiments over synthetically generated and real datasets. Each experiment is also performed for a baseline protocol which has no privacy concern to show that the overhead comes with security and privacy by comparing the baseline protocol and our protocol.
Available Versions of this Item
Repository Staff Only: item control page