The main aim of the phd was to apply new techniques and mathematical tools to Data Analysis, mainly to the analysis of foundry data. The partner of the project, that is Enginsoft, a company in Padova, provided data acquired and processed by one of its own software: SmartProdActive. After a detailed analysis of this tool and, we began our mathematical study, keeping in mind the task to address: solving a classification problem on highly imbalanced datasets. First we focused our attention on kernels, allowed us to build a connection between the approximation theory and the Machine Learning framework, where kernels are widely used with the so-called kernel trick. In this context we have introduced and described a suitable modification of KNN using topological information, called TDA Local. Then, we have moved to another context: classification of Persistent Dagrams using SVM with Persistence Kernels. Finally, we investigated the use of Persistent Homology in the context of Intrinsic Dimension of Manifold. All topics have been enriched with a lot of numerical tests and the related Python code are available in the GitHub repository.

Applications of Persistent Homology: Data Classification and Intrinsic Dimension of Manifold

BANDIZIOL, CINZIA
2025

Abstract

The main aim of the phd was to apply new techniques and mathematical tools to Data Analysis, mainly to the analysis of foundry data. The partner of the project, that is Enginsoft, a company in Padova, provided data acquired and processed by one of its own software: SmartProdActive. After a detailed analysis of this tool and, we began our mathematical study, keeping in mind the task to address: solving a classification problem on highly imbalanced datasets. First we focused our attention on kernels, allowed us to build a connection between the approximation theory and the Machine Learning framework, where kernels are widely used with the so-called kernel trick. In this context we have introduced and described a suitable modification of KNN using topological information, called TDA Local. Then, we have moved to another context: classification of Persistent Dagrams using SVM with Persistence Kernels. Finally, we investigated the use of Persistent Homology in the context of Intrinsic Dimension of Manifold. All topics have been enriched with a lot of numerical tests and the related Python code are available in the GitHub repository.
31-mar-2025
Inglese
DE MARCHI, STEFANO
Università degli studi di Padova
File in questo prodotto:
File Dimensione Formato  
tesi_definitiva_Cinzia_Bandiziol.pdf

accesso aperto

Dimensione 2.21 MB
Formato Adobe PDF
2.21 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/218033
Il codice NBN di questa tesi è URN:NBN:IT:UNIPD-218033