Abstract: Frequent itemset mining is a data mining technique widely used on massive datasets. In cloud computing, the dataset may be encrypted for privacy protection. Therefore, frequent itemset mining over encrypted data is a crucial application in secure cloud computing. In this paper, we propose an effective privacy-preserving framework where the cloud server can directly perform data mining on the encrypted database without interacting with other cloud servers. We first design three security primitives to implement subset determination, accumulation, and comparison in the encrypted domain for frequent itemset mining. Based on the proposed framework, we then propose two secure protocols that allow the cloud server to perform frequent itemset mining on encrypted cloud data with these security primitives. The first protocol leaks no information to the cloud and the second protocol has the advantage of more efficient mining performance. We then present two strategies with parallel algorithms and GPU computing to accelerate the running time. We also analyze the security of our protocols and the computational complexities. Experimental results show that our serial-based protocols achieve shorter running times and higher levels of privacy than previous solutions. Our multi-CPU (or GPU) based parallel protocol can further reduce the practical running time.
Loading