Multi-keyword search over encrypted data with scoring and search pattern obfuscation

Örencik, Cengiz and Selçuk, Ayşe and Savaş, Erkay and Kantarcıoğlu, Murat (2015) Multi-keyword search over encrypted data with scoring and search pattern obfuscation. International Journal of Information Security . ISSN 1615-5262 (Print) 1615-5270 (Online) Published Online First http://dx.doi.org/10.1007/s10207-015-0294-9

Warning
There is a more recent version of this item available.
[thumbnail of 06_10.1007_2Fs10207-015-0294-9.pdf] PDF
06_10.1007_2Fs10207-015-0294-9.pdf
Restricted to Registered users only

Download (882kB) | Request a copy

Abstract

Search over encrypted data recently became a critical operation that raised a considerable amount of interest in both academia and industry. Especially, as outsourcing, sensitive data to cloud prove to be a strong trend to benefit from the unmatched storage and computing capacities thereof. Indeed, privacy-preserving search over encrypted data, an apt term to address privacy-related issues concomitant in outsourcing sensitive data, have been widely investigated in the literature under different models and assumptions. In this work, we propose an efficient scheme that allows privacy-preserving search over encrypted data using queries with multiple keywords. Most important contributions of this work are as follows. Firstly, using a property referred as δ-mean query obfuscation, the proposed scheme hides the search patterns, which are allowed to leak in many works in the literature including our preliminary work on the subject Orencik et al. (2013) [1]. Secondly, a two-server setting is employed to eliminate the correlation between the queries and matching documents sent to the user under the assumption that the two servers are not colluding. Thirdly, we propose a novel compression scheme that reduces both the communication cost between the two servers and the computation cost of the search operation more than 55 times compared to the standard approach. And finally, the proposed scheme also provides an effective scoring and ranking capability that is based on term frequency–inverse document frequency (tf-idf) weights of keyword–document pairs. Our analyses demonstrate that the proposed scheme is privacy-preserving, efficient and effective.
Item Type: Article
Uncontrolled Keywords: Encrypted cloud data; Secure search; Privacy preservation; Efficiency; Scoring
Subjects: Q Science > QA Mathematics > QA075 Electronic computers. Computer science
Q Science > QA Mathematics > QA076 Computer software
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng.
Faculty of Engineering and Natural Sciences
Depositing User: Erkay Savaş
Date Deposited: 22 Dec 2015 14:47
Last Modified: 26 Apr 2022 09:27
URI: https://research.sabanciuniv.edu/id/eprint/28296

Available Versions of this Item

Actions (login required)

View Item
View Item