Privacy in Big Data analytics is one of the most important issues that analysts and businesses face when managing personal data. In a privacy preserving analysis process, the privacy risk on the individuals represented in the data is firstly evaluated, then the data is appropriately modified in order to preserve privacy while at the same time maintaining a certain level of data quality. In this thesis we focus on privacy risk assessment, proposing new models and algorithms to deal with this fundamental part of privacy aware systems. We propose some extensions to an existing state-of-the-art privacy risk assessment framework, to improve on existing literature. Then, we propose a classification based methodology to predict privacy risk. We validate our proposal on three different types of real world data: human mobility, retail and social network data. Finally we propose a new model for the behavior of an adversary in human mobility data, leveraging the natural structure and constraints of this kind of data.

Modeling & Predicting Privacy Risk in Personal Data

2020

Abstract

Privacy in Big Data analytics is one of the most important issues that analysts and businesses face when managing personal data. In a privacy preserving analysis process, the privacy risk on the individuals represented in the data is firstly evaluated, then the data is appropriately modified in order to preserve privacy while at the same time maintaining a certain level of data quality. In this thesis we focus on privacy risk assessment, proposing new models and algorithms to deal with this fundamental part of privacy aware systems. We propose some extensions to an existing state-of-the-art privacy risk assessment framework, to improve on existing literature. Then, we propose a classification based methodology to predict privacy risk. We validate our proposal on three different types of real world data: human mobility, retail and social network data. Finally we propose a new model for the behavior of an adversary in human mobility data, leveraging the natural structure and constraints of this kind of data.
27-feb-2020
Italiano
Monreale, Anna
Pedreschi, Dino
Università degli Studi di Pisa
File in questo prodotto:
File Dimensione Formato  
Relazione.pdf

Open Access dal 05/03/2023

Tipologia: Altro materiale allegato
Dimensione 131.09 kB
Formato Adobe PDF
131.09 kB Adobe PDF Visualizza/Apri
TESI_DI_DOTTORATO__reviewed__2.pdf

Open Access dal 05/03/2023

Tipologia: Altro materiale allegato
Dimensione 20.03 MB
Formato Adobe PDF
20.03 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/151226
Il codice NBN di questa tesi è URN:NBN:IT:UNIPI-151226