Image matching is a crucial step in various photogrammetric and computer vision applications. Traditional handcrafted matching algorithms automatically identify correspondences between image pairs, correspondences that can be triangulated to create a sparse map of the environment, while estimating the pose of the cameras and, in some cases, the internal camera parameters and distorsions. However, these algorithms often suffer from performance degradation under challenging conditions, such as significant changes in illumination, time or viewing angle. To overcome these limitations, several deep-learning (DL) based matching algorithms have been developed. Trained on large datasets, they have demonstrated considerable potential across a range of computer vision tasks. However, their application in photogrammetric and positioning applications remains underexplored. With image datasets captured within a short time frame and with high redundancy and gradual changes in baselines and viewing angles, DL-based approaches typically provide an accuracy that is similar to traditional handcrafted approaches, such SIFT, which is normally considered the reference approach. However, the radiometric invariance of DL-based methods can be leveraged to address situations where SIFT is prone to failure, thus paving the way for new applications in both photogrammetric and positioning domains. The aim of this work is to explore these new DL-based methods and propose new workflows validated under a metrological approach for four application scenarios: 1) multi-modal matching between infrared and RGB images; 2) multi-modal matching between RGB images and LiDAR point clouds; 3) matching off-track satellite images for 3D reconstruction and change detection; 4) precise positioning in GNSS-denied environments. In the PhD work, new workflows and processing methods have been implemented and validated taking into consideration the accuracy and metric aspects of the Geomatic community.

DEEP-LEARNING LOCAL FEATURES FOR PHOTOGRAMMETRIC APPLICATIONS AND VISUAL POSITIONING

Morelli, Luca
2025

Abstract

Image matching is a crucial step in various photogrammetric and computer vision applications. Traditional handcrafted matching algorithms automatically identify correspondences between image pairs, correspondences that can be triangulated to create a sparse map of the environment, while estimating the pose of the cameras and, in some cases, the internal camera parameters and distorsions. However, these algorithms often suffer from performance degradation under challenging conditions, such as significant changes in illumination, time or viewing angle. To overcome these limitations, several deep-learning (DL) based matching algorithms have been developed. Trained on large datasets, they have demonstrated considerable potential across a range of computer vision tasks. However, their application in photogrammetric and positioning applications remains underexplored. With image datasets captured within a short time frame and with high redundancy and gradual changes in baselines and viewing angles, DL-based approaches typically provide an accuracy that is similar to traditional handcrafted approaches, such SIFT, which is normally considered the reference approach. However, the radiometric invariance of DL-based methods can be leveraged to address situations where SIFT is prone to failure, thus paving the way for new applications in both photogrammetric and positioning domains. The aim of this work is to explore these new DL-based methods and propose new workflows validated under a metrological approach for four application scenarios: 1) multi-modal matching between infrared and RGB images; 2) multi-modal matching between RGB images and LiDAR point clouds; 3) matching off-track satellite images for 3D reconstruction and change detection; 4) precise positioning in GNSS-denied environments. In the PhD work, new workflows and processing methods have been implemented and validated taking into consideration the accuracy and metric aspects of the Geomatic community.
27-feb-2025
Inglese
Image matching, Deep learning, Photogrammetry, Visual Odometry, Simultaneous Localization and Mapping
Vitti, Alfonso
Università degli studi di Trento
TRENTO
124
File in questo prodotto:
File Dimensione Formato  
PhD_thesis_Luca_Morelli.pdf

accesso aperto

Dimensione 136.63 MB
Formato Adobe PDF
136.63 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/195502
Il codice NBN di questa tesi è URN:NBN:IT:UNITN-195502