DEEP-LEARNING LOCAL FEATURES FOR PHOTOGRAMMETRIC APPLICATIONS AND VISUAL POSITIONING

Morelli, Luca

Image matching is a crucial step in various photogrammetric and computer vision applications. Traditional handcrafted matching algorithms automatically identify correspondences between image pairs, correspondences that can be triangulated to create a sparse map of the environment, while estimating the pose of the cameras and, in some cases, the internal camera parameters and distorsions. However, these algorithms often suffer from performance degradation under challenging conditions, such as significant changes in illumination, time or viewing angle. To overcome these limitations, several deep-learning (DL) based matching algorithms have been developed. Trained on large datasets, they have demonstrated considerable potential across a range of computer vision tasks. However, their application in photogrammetric and positioning applications remains underexplored. With image datasets captured within a short time frame and with high redundancy and gradual changes in baselines and viewing angles, DL-based approaches typically provide an accuracy that is similar to traditional handcrafted approaches, such SIFT, which is normally considered the reference approach. However, the radiometric invariance of DL-based methods can be leveraged to address situations where SIFT is prone to failure, thus paving the way for new applications in both photogrammetric and positioning domains. The aim of this work is to explore these new DL-based methods and propose new workflows validated under a metrological approach for four application scenarios: 1) multi-modal matching between infrared and RGB images; 2) multi-modal matching between RGB images and LiDAR point clouds; 3) matching off-track satellite images for 3D reconstruction and change detection; 4) precise positioning in GNSS-denied environments. In the PhD work, new workflows and processing methods have been implemented and validated taking into consideration the accuracy and metric aspects of the Geomatic community.

DEEP-LEARNING LOCAL FEATURES FOR PHOTOGRAMMETRIC APPLICATIONS AND VISUAL POSITIONING

Morelli, Luca

2025

Abstract

Image matching is a crucial step in various photogrammetric and computer vision applications. Traditional handcrafted matching algorithms automatically identify correspondences between image pairs, correspondences that can be triangulated to create a sparse map of the environment, while estimating the pose of the cameras and, in some cases, the internal camera parameters and distorsions. However, these algorithms often suffer from performance degradation under challenging conditions, such as significant changes in illumination, time or viewing angle. To overcome these limitations, several deep-learning (DL) based matching algorithms have been developed. Trained on large datasets, they have demonstrated considerable potential across a range of computer vision tasks. However, their application in photogrammetric and positioning applications remains underexplored. With image datasets captured within a short time frame and with high redundancy and gradual changes in baselines and viewing angles, DL-based approaches typically provide an accuracy that is similar to traditional handcrafted approaches, such SIFT, which is normally considered the reference approach. However, the radiometric invariance of DL-based methods can be leveraged to address situations where SIFT is prone to failure, thus paving the way for new applications in both photogrammetric and positioning domains. The aim of this work is to explore these new DL-based methods and propose new workflows validated under a metrological approach for four application scenarios: 1) multi-modal matching between infrared and RGB images; 2) multi-modal matching between RGB images and LiDAR point clouds; 3) matching off-track satellite images for 3D reconstruction and change detection; 4) precise positioning in GNSS-denied environments. In the PhD work, new workflows and processing methods have been implemented and validated taking into consideration the accuracy and metric aspects of the Geomatic community.

Scheda breve

Scheda completa

Scheda completa (DC)

	Facoltà/Dipartimento
	
				Ingegneria civile, ambientale e mecc (29/10/12-)
			
	Corso di studio
	
				Civil, Environmental and Mechanical Engineering
			
	Data di pubblicazione
	
				27-feb-2025
			
	Lingua
	
				Inglese
			
	Parola chiave
	
				Image matching, Deep learning, Photogrammetry, Visual Odometry, Simultaneous Localization and Mapping
			
	Relatore, Supervisor, Advisor o Tutor
	
				Remondino, Fabio
Vitti, Alfonso
			
	Nome Editore
	
				Università degli studi di Trento
			
	Città Editore
	
				TRENTO
			
	Numero di pagine
	
				124
			
	Collezione di appartenenza
	
				Università degli Studi di Trento

File in questo prodotto:

File	Dimensione	Formato
PhD_thesis_Luca_Morelli.pdf accesso aperto Licenza: Tutti i diritti riservati Dimensione 136.63 MB Formato Adobe PDF Visualizza/Apri	136.63 MB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/195502

Il codice NBN di questa tesi è URN:NBN:IT:UNITN-195502