Tesi etd-09262022-162111

Tipo di tesi

Tesi di laurea magistrale

Autore

ARTIOLI, ANDREA

URN

etd-09262022-162111

Titolo

Verso la generalizzazione degli attacchi di re-identification basati su hint

Titolo in inglese

Towards generalization of hints-based re-identification attack

Struttura

Dipartimento di Scienze Fisiche, Informatiche e Matematiche

Corso di studi

Informatica (D.M.270/04)

Commissione

Nome Commissario	Qualifica
LEONCINI MAURO	Primo relatore
BEDOGNI LUCA	Secondo relatore

Parole chiave

Dataset Enrichment
Distributed MCTA
Privacy
Re-identification
Security

Data inizio appello

2022-10-28

Disponibilità

Accessibile via web (tutti i file della tesi sono accessibili)

Riassunto analitico

In questo lavoro è presentato un nuovo framework per attacchi di
reidentificazione basato su ragionamenti deduttivi e l'arricchimento del
dataset. L'idea di base è quella di trasformare i dati in una
rappresentazione standard basata su grafo. Dopodichè, tramite l'aggiunta di
"hint", il dataset viene arricchito di nuove informazioni ricavate
dall'attaccante. Queste possono essere ottenute nei modi più disperati:
dalla memoria dell'individuo, all'accesso di videocamere o dal social media
scraping. In seguito, l'attaccante stesso definisce anche una serie di
"labels" che permettono al modello di capire quali informazioni si vogliono
dedurre dall'attacco. Dopodichè, sarà il modello ad applicare ragionamenti
deduttivi che lo porteranno a svelare queste informazioni target e quindi a
deanonimizzare il dataset. Sviluppare un framework il più generale
possibile è l'obiettivo principale di questo lavoro. Questa nuova
metodologia è stata testata su un test case che simula un'applicazione di
Mobile Contact Tracing (MCTA), migliorando sensibilmente lo stato dell'arte
sia per le deanonimizzazioni, sia per le performance. Questo è dovuto a una
migliore generalizzazione dei ragionamenti deduttivi considerati, scaturiti
dall'analisi di un modello molto più ad alto livello del precedente.
Tenendo conto che l'obiettivo ultimo dello studio degli attacchi alla
privacy è il miglioramento delle difese esistenti, lo studio delle istanze
dove l'attacco è fallito ha portato alla definizione delle "secure
structures", ovvero di grafi con determinate proprietà che sistematicamente
non permettono la deduzione di informazioni sugli utenti coinvolti nel
dataset.

Abstract

In this work, a novel model of re-identification framework based on deductive reasoning and dataset enrichment has been presented. The basic idea is to transform data into a standard graph-based representation. Then, by adding "hints", the dataset is enriched with new information obtained by the attacker. These can be obtained in the most disparate ways: from the memory of the individual, to the access of video cameras or from social media scraping. Later, the attacker also defines a series of "labels" that allow the model to understand what information is to be deduced from the attack. After this, the model will apply deductive reasoning that will lead it to reveal this target information and then to deanonymize the dataset. Developing a framework as general as possible is the main objective of this work. This new methodology has been compared on a test case that simulates a Mobile Contact Tracing Applications (MCTA), significantly improving the state of the art for both deanonymizations and performance. This is due to a better generalization of the deductive reasonings considered, resulting from the analysis of a much higher level model than the previous one. Considering that the ultimate goal of the study of privacy attacks is the improvement of existing defenses, the study of instances where the attack failed led to the definition of "secure structures", which are graphs with certain properties that do not systematically allow the deduction of information about the users involved in the dataset.

File

Nome file		Dimensione	Tempo di download stimato (Ore:Minuti:Secondi)
Nome file		Dimensione	28.8 Modem	56K Modem	ISDN (64 Kb)	ISDN (128 Kb)	piu' di 128 Kb
	Tesi.pdf	899.83 Kb	00:04:09	00:02:08	00:01:52	00:00:56	00:00:04
Contatta l'autore