Mobility data from sources like GPS devices, mobile networks, and Internet of Things sensors offers valuable insights into the movement of people and objects, but it often lacks meaningful context. The first thesis contribution is a novel methodology for the semantic enrichment of raw spatio-temporal data, i.e., trajectories, allowing for the integration of various contextual elements, such as points of interest (POIs), transportation modes, and environmental factors, to name a few. The semantic enrichment provides deeper insights into movement patterns and behaviors. However, the complexity and heterogeneity of semantic trajectories present challenges for storage and analysis. To face these challenges, the second thesis contribution is to propose a summarization method that reduces the volume of enriched trajectories while preserving essential semantic information. This approach involves segmenting geographical areas into meaningful regions and discretizing trajectories based on the regions traversed. Dividing areas into semantically relevant regions raises several challenges regarding their identification and representation, also considering new and richer data sources. The third thesis contribution is a dynamic method for identifying and representing urban regions using user-generated data, such as social media posts and geolocated images. The approach leverages Large Language Models (LLMs) to produce human-readable summaries, enhancing the interpretation of the regions and giving a human-centric perspective of the urban environment. The contributions of this thesis lie in offering robust tools for enriching, summarizing, and analyzing mobility data, addressing key challenges related to data volume, heterogeneity, and context-awareness.

Advanced Methods for Semantic Enrichment and Summarization of Mobility Data

PUGLIESE, CHIARA
2025

Abstract

Mobility data from sources like GPS devices, mobile networks, and Internet of Things sensors offers valuable insights into the movement of people and objects, but it often lacks meaningful context. The first thesis contribution is a novel methodology for the semantic enrichment of raw spatio-temporal data, i.e., trajectories, allowing for the integration of various contextual elements, such as points of interest (POIs), transportation modes, and environmental factors, to name a few. The semantic enrichment provides deeper insights into movement patterns and behaviors. However, the complexity and heterogeneity of semantic trajectories present challenges for storage and analysis. To face these challenges, the second thesis contribution is to propose a summarization method that reduces the volume of enriched trajectories while preserving essential semantic information. This approach involves segmenting geographical areas into meaningful regions and discretizing trajectories based on the regions traversed. Dividing areas into semantically relevant regions raises several challenges regarding their identification and representation, also considering new and richer data sources. The third thesis contribution is a dynamic method for identifying and representing urban regions using user-generated data, such as social media posts and geolocated images. The approach leverages Large Language Models (LLMs) to produce human-readable summaries, enhancing the interpretation of the regions and giving a human-centric perspective of the urban environment. The contributions of this thesis lie in offering robust tools for enriching, summarizing, and analyzing mobility data, addressing key challenges related to data volume, heterogeneity, and context-awareness.
15-feb-2025
Italiano
mobility data
semantic enrichment
semantic trajectory
trajectory summarization
Renso, Chiara
Pinelli, Fabio
File in questo prodotto:
File Dimensione Formato  
PhD_Thesis_Pugliese_Chiara.pdf

accesso aperto

Dimensione 16.22 MB
Formato Adobe PDF
16.22 MB Adobe PDF Visualizza/Apri
Scheda_Attivita_PhD.pdf

non disponibili

Dimensione 146.34 kB
Formato Adobe PDF
146.34 kB Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/216385
Il codice NBN di questa tesi è URN:NBN:IT:UNIPI-216385