Multi-keyword search over encrypted data with scoring and search pattern obfuscation

Örencik, Cengiz and Selçuk, Ayşe and Savaş, Erkay and Kantarcıoğlu, Murat Multi-keyword search over encrypted data with scoring and search pattern obfuscation. International Journal of Information Security . ISSN 1615-5262 (Print) 1615-5270 (Online) Published Online First http://dx.doi.org/10.1007/s10207-015-0294-9

WarningThere is a more recent version of this item available.

[img]PDF - Registered users only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Official URL: http://dx.doi.org/10.1007/s10207-015-0294-9


Search over encrypted data recently became a critical operation that raised a considerable amount of interest in both academia and industry. Especially, as outsourcing, sensitive data to cloud prove to be a strong trend to benefit from the unmatched storage and computing capacities thereof. Indeed, privacy-preserving search over encrypted data, an apt term to address privacy-related issues concomitant in outsourcing sensitive data, have been widely investigated in the literature under different models and assumptions. In this work, we propose an efficient scheme that allows privacy-preserving search over encrypted data using queries with multiple keywords. Most important contributions of this work are as follows. Firstly, using a property referred as δ-mean query obfuscation, the proposed scheme hides the search patterns, which are allowed to leak in many works in the literature including our preliminary work on the subject Orencik et al. (2013) [1]. Secondly, a two-server setting is employed to eliminate the correlation between the queries and matching documents sent to the user under the assumption that the two servers are not colluding. Thirdly, we propose a novel compression scheme that reduces both the communication cost between the two servers and the computation cost of the search operation more than 55 times compared to the standard approach. And finally, the proposed scheme also provides an effective scoring and ranking capability that is based on term frequency–inverse document frequency (tf-idf) weights of keyword–document pairs. Our analyses demonstrate that the proposed scheme is privacy-preserving, efficient and effective.

Item Type:Article
Uncontrolled Keywords:Encrypted cloud data; Secure search; Privacy preservation; Efficiency; Scoring
Subjects:Q Science > QA Mathematics > QA075 Electronic computers. Computer science
Q Science > QA Mathematics > QA076 Computer software
ID Code:28296
Deposited By:Erkay Savaş
Deposited On:22 Dec 2015 14:47
Last Modified:11 Nov 2016 22:42

Available Versions of this Item

Repository Staff Only: item control page