The thesis tackles two problems in the recently-born field of Explainable AI (XAI), and proposes some algorithms to solve them. XAI has the overarching goal of providing human-understandable explanations of Machine Learning models, which, nowadays, operate as highly complex black-box models whose decisions, especially in high-stakes and critical settings, we are not able to understand. The thesis tackles the novel problem of Local-to-Global (L2G) explainability, and local explainability. In a L2G setting one wishes to infer an understanding of the overall behavior of a model starting from explanations of its punctual decisions, that is, to infer global explanations from local ones. We propose two Local-to-Global algorithms to tackle this problem, Rule Relevance Score and GLocalX. Then, we focus on local explainability, and provide an algorithm, TriplEx, to explain Transformer-based models on a variety of tasks.

Opening the Black Box: Empowering Machine Learning Models with Explanations

SETZU, MATTIA
2022

Abstract

The thesis tackles two problems in the recently-born field of Explainable AI (XAI), and proposes some algorithms to solve them. XAI has the overarching goal of providing human-understandable explanations of Machine Learning models, which, nowadays, operate as highly complex black-box models whose decisions, especially in high-stakes and critical settings, we are not able to understand. The thesis tackles the novel problem of Local-to-Global (L2G) explainability, and local explainability. In a L2G setting one wishes to infer an understanding of the overall behavior of a model starting from explanations of its punctual decisions, that is, to infer global explanations from local ones. We propose two Local-to-Global algorithms to tackle this problem, Rule Relevance Score and GLocalX. Then, we focus on local explainability, and provide an algorithm, TriplEx, to explain Transformer-based models on a variety of tasks.
22-mag-2022
Italiano
explainability
Monreale, Anna
Pedreschi, Dino
File in questo prodotto:
File Dimensione Formato  
thesis.pdf

accesso aperto

Dimensione 4.09 MB
Formato Adobe PDF
4.09 MB Adobe PDF Visualizza/Apri
thesis_synthesis_and_phd_activity.pdf

accesso aperto

Dimensione 222.73 kB
Formato Adobe PDF
222.73 kB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/215812
Il codice NBN di questa tesi è URN:NBN:IT:UNIPI-215812