Opening the Black Box: Empowering Machine Learning Models with Explanations

Setzu, Mattia

The thesis tackles two problems in the recently-born field of Explainable AI (XAI), and proposes some algorithms to solve them. XAI has the overarching goal of providing human-understandable explanations of Machine Learning models, which, nowadays, operate as highly complex black-box models whose decisions, especially in high-stakes and critical settings, we are not able to understand. The thesis tackles the novel problem of Local-to-Global (L2G) explainability, and local explainability. In a L2G setting one wishes to infer an understanding of the overall behavior of a model starting from explanations of its punctual decisions, that is, to infer global explanations from local ones. We propose two Local-to-Global algorithms to tackle this problem, Rule Relevance Score and GLocalX. Then, we focus on local explainability, and provide an algorithm, TriplEx, to explain Transformer-based models on a variety of tasks.

Opening the Black Box: Empowering Machine Learning Models with Explanations

SETZU, MATTIA

2022

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				22-mag-2022
			
	Lingua
	
				Italiano
			
	Parola chiave
	
				explainability
			
	Relatore, Supervisor, Advisor o Tutor
	
				Monreale, Anna
			
	Correlatore, Controrelatore, Co-Supervisor,  Co-Tutor o Coordinatori
	
				Pedreschi, Dino
			
	Collezione di appartenenza
	
				Università degli Studi di Pisa

File in questo prodotto:

File	Dimensione	Formato
thesis.pdf accesso aperto Dimensione 4.09 MB Formato Adobe PDF Visualizza/Apri	4.09 MB	Adobe PDF	Visualizza/Apri
thesis_synthesis_and_phd_activity.pdf accesso aperto Dimensione 222.73 kB Formato Adobe PDF Visualizza/Apri	222.73 kB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/215812

Il codice NBN di questa tesi è URN:NBN:IT:UNIPI-215812