The need for integrating Artificial Intelligence (AI) and symbolic knowledge has been claimed for a long time. In this context, Neuro-Symbolic AI (NeSy) is emerging as a paradigm for the principled integration of sub-symbolic connectionist systems and logic knowledge. However, a remarkable gap burdens on the integration of ML algorithms and symbolic representations: the latter are discrete objects, while ML models are mostly based on gradient-based optimization in continuous spaces. Hence, using continuous optimization to learn and exploit logic requirements is a challenging problem: a possible solution is to embed formulae in a continuous space in a meaningful way, i.e. preserving the semantics. This thesis presents a range of techniques for defining, characterizing and deploying continuous embeddings of logical formulae, with a focus on Signal Temporal Logic (STL). Starting from a kernel function measuring similarity between STL formulae, we propose a constructive algorithm for computing interpretable finite-dimensional explicit embeddings of STL formulae. We demonstrate their predictive power and we also provide explanations for a vast amount of information retained by the embeddings. Then, we propose an algorithm for mining STL requirements from time-series data, based on the interaction between Bayesian Optimization and Information Retrieval techniques, that works by searching from the discriminating STL property directly in a continuous space representing its semantics. We also propose an interpretable-by-design concept based model for time series classification and we develop an autoencoder architecture based on Graph Neural Networks to construct invertible embeddings of formulae.

Neuro-Symbolic Methods for Time Series Data: Continuous Representations and Learning with Signal Temporal Logic

SAVERI, GAIA
2025

Abstract

The need for integrating Artificial Intelligence (AI) and symbolic knowledge has been claimed for a long time. In this context, Neuro-Symbolic AI (NeSy) is emerging as a paradigm for the principled integration of sub-symbolic connectionist systems and logic knowledge. However, a remarkable gap burdens on the integration of ML algorithms and symbolic representations: the latter are discrete objects, while ML models are mostly based on gradient-based optimization in continuous spaces. Hence, using continuous optimization to learn and exploit logic requirements is a challenging problem: a possible solution is to embed formulae in a continuous space in a meaningful way, i.e. preserving the semantics. This thesis presents a range of techniques for defining, characterizing and deploying continuous embeddings of logical formulae, with a focus on Signal Temporal Logic (STL). Starting from a kernel function measuring similarity between STL formulae, we propose a constructive algorithm for computing interpretable finite-dimensional explicit embeddings of STL formulae. We demonstrate their predictive power and we also provide explanations for a vast amount of information retained by the embeddings. Then, we propose an algorithm for mining STL requirements from time-series data, based on the interaction between Bayesian Optimization and Information Retrieval techniques, that works by searching from the discriminating STL property directly in a continuous space representing its semantics. We also propose an interpretable-by-design concept based model for time series classification and we develop an autoencoder architecture based on Graph Neural Networks to construct invertible embeddings of formulae.
17-feb-2025
Italiano
explainable artificial intelligence
neuro-symbolic learning
temporal logic
Bortolussi, Luca
Nenzi, Laura
File in questo prodotto:
File Dimensione Formato  
FinalReport_Saveri_2.pdf

non disponibili

Licenza: Tutti i diritti riservati
Dimensione 268.94 kB
Formato Adobe PDF
268.94 kB Adobe PDF
PhD_Thesis_Saveri_Gaia_.pdf

accesso aperto

Licenza: Tutti i diritti riservati
Dimensione 8.05 MB
Formato Adobe PDF
8.05 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/215360
Il codice NBN di questa tesi è URN:NBN:IT:UNIPI-215360