Image matching is a crucial step in various photogrammetric and computer vision applications. Traditional handcrafted matching algorithms automatically identify correspondences between image pairs, correspondences that can be triangulated to create a sparse map of the environment, while estimating the pose of the cameras and, in some cases, the internal camera parameters and distorsions. However, these algorithms often suffer from performance degradation under challenging conditions, such as significant changes in illumination, time or viewing angle. To overcome these limitations, several deep-learning (DL) based matching algorithms have been developed. Trained on large datasets, they have demonstrated considerable potential across a range of computer vision tasks. However, their application in photogrammetric and positioning applications remains underexplored. With image datasets captured within a short time frame and with high redundancy and gradual changes in baselines and viewing angles, DL-based approaches typically provide an accuracy that is similar to traditional handcrafted approaches, such SIFT, which is normally considered the reference approach. However, the radiometric invariance of DL-based methods can be leveraged to address situations where SIFT is prone to failure, thus paving the way for new applications in both photogrammetric and positioning domains. The aim of this work is to explore these new DL-based methods and propose new workflows validated under a metrological approach for four application scenarios: 1) multi-modal matching between infrared and RGB images; 2) multi-modal matching between RGB images and LiDAR point clouds; 3) matching off-track satellite images for 3D reconstruction and change detection; 4) precise positioning in GNSS-denied environments. In the PhD work, new workflows and processing methods have been implemented and validated taking into consideration the accuracy and metric aspects of the Geomatic community.
DEEP-LEARNING LOCAL FEATURES FOR PHOTOGRAMMETRIC APPLICATIONS AND VISUAL POSITIONING
Morelli, Luca
2025
Abstract
Image matching is a crucial step in various photogrammetric and computer vision applications. Traditional handcrafted matching algorithms automatically identify correspondences between image pairs, correspondences that can be triangulated to create a sparse map of the environment, while estimating the pose of the cameras and, in some cases, the internal camera parameters and distorsions. However, these algorithms often suffer from performance degradation under challenging conditions, such as significant changes in illumination, time or viewing angle. To overcome these limitations, several deep-learning (DL) based matching algorithms have been developed. Trained on large datasets, they have demonstrated considerable potential across a range of computer vision tasks. However, their application in photogrammetric and positioning applications remains underexplored. With image datasets captured within a short time frame and with high redundancy and gradual changes in baselines and viewing angles, DL-based approaches typically provide an accuracy that is similar to traditional handcrafted approaches, such SIFT, which is normally considered the reference approach. However, the radiometric invariance of DL-based methods can be leveraged to address situations where SIFT is prone to failure, thus paving the way for new applications in both photogrammetric and positioning domains. The aim of this work is to explore these new DL-based methods and propose new workflows validated under a metrological approach for four application scenarios: 1) multi-modal matching between infrared and RGB images; 2) multi-modal matching between RGB images and LiDAR point clouds; 3) matching off-track satellite images for 3D reconstruction and change detection; 4) precise positioning in GNSS-denied environments. In the PhD work, new workflows and processing methods have been implemented and validated taking into consideration the accuracy and metric aspects of the Geomatic community.File | Dimensione | Formato | |
---|---|---|---|
PhD_thesis_Luca_Morelli.pdf
accesso aperto
Dimensione
136.63 MB
Formato
Adobe PDF
|
136.63 MB | Adobe PDF | Visualizza/Apri |
I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/20.500.14242/195502
URN:NBN:IT:UNITN-195502