Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/93052
Title: Privacy-preserving data outsourcing in the cloud via semantic data splitting
Author: Sánchez Ruenes, David
Batet, Montserrat  
Others: Universitat Rovira i Virgili (URV)
Universitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3)
Citation: Sánchez, D. & Batet, M. (2017). Privacy-preserving data outsourcing in the cloud via semantic data splitting. Computer Communications, 110(), 187-201. doi: 10.1016/j.comcom.2017.06.012
Abstract: Even though cloud computing provides many intrinsic benefits (e.g., cost savings, availability, scalability, etc.), privacy concerns related to the lack of control over the storage and management of the outsourced (confidential) data still prevent many customers from migrating to the cloud. In this respect, several privacy-protection mechanisms based on a prior encryption of the data to be outsourced have been proposed. Data encryption offers robust security, but at the cost of hampering the efficiency of the service and limiting the functionalities that can be applied over the (encrypted) data stored on cloud premises. Because both efficiency and functionality are crucial advantages of cloud computing, especially in SaaS, in this paper we aim at retaining them by proposing a privacy-protection mechanism that relies on splitting (clear) data, and on the distributed storage offered by the increasingly popular notion of multi-clouds. Specifically, we propose a semantically-grounded data splitting mechanism that is able to automatically detect pieces of data that may cause privacy risks and split them on local premises, so that each chunk does not incur in those risks; then, chunks of clear data are independently stored into the separate locations of a multi-cloud, so that external entities (cloud service providers and attackers) cannot have access to the whole confidential data. Because partial data are stored in clear on cloud premises, outsourced functionalities are seamlessly and efficiently supported by just broadcasting queries to the different cloud locations. To enforce a robust privacy notion, our proposal relies on a privacy model that offers a priori privacy guarantees; to ensure its feasibility, we have designed heuristic algorithms that minimize the number of cloud storage locations we need; to show its potential and generality, we have applied it to the least structured and most challenging data type: plain textual documents.
Keywords: data outsourcing
multi-cloud
privacy protection
data splitting
semantics
DOI: 10.1016/j.comcom.2017.06.012
Document type: info:eu-repo/semantics/article
Version: info:eu-repo/semantics/submittedVersion
Issue Date: Apr-2017
Publication license: http://creativecommons.org/licenses/by-nc-nd/3.0/es/  
Appears in Collections:Articles cientÍfics
Articles

Files in This Item:
File Description SizeFormat 
privacypreserving.pdfPreprint416,18 kBAdobe PDFThumbnail
View/Open
Share:
Export:
View statistics

Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.