The study of sustainable mobility is becoming central to urban mobility research. Telephonic data represent an important tool for understanding the current patterns of urban mobility; in the last two decades, they have been increasing both in terms of available amount and descriptive power as a proxy for human mobility. Despite the intrinsic uncertainty given by these noisy data, statisticians have paid little attention in the field. This thesis intertwines the statistical analysis of urban mobility based on telephonic data with longitudinal compositional data analysis, that is, the analysis of parts that constitute a whole, where the relative proportions of components change dynamically. Capturing the dependency between successive compositional observations requires tailored methods to take into account the constrained nature of the data. In Chapter 1, we provide a survey of the main probabilistic and statistical methodologies for the exploitation of telephonic data in the field of urban mobility analysis. In Chapter 2, we provide a new approach based on compositional data for urban mobility, where people’s trajectories are represented in the simplex by the proportions of types of roads in their surroundings as they move. We propose a state-space model for the analysis of compositional time series and a model-based clustering approach for the unsupervised aggregation of those series, to uncover the main patterns of human mobility. In Chapter 3, we focus on an alternative approach to longitudinal compositional data analysis, by modeling the observations directly on the simplex and the longitudinal aspect through generalized estimating equations. This more general model is useful for applications where the dependence on the covariates is the main interest.

Models for Longitudinal Compositional Data Analysis with Applications to Mobility Studies

PANAROTTO, ANDREA
2025

Abstract

The study of sustainable mobility is becoming central to urban mobility research. Telephonic data represent an important tool for understanding the current patterns of urban mobility; in the last two decades, they have been increasing both in terms of available amount and descriptive power as a proxy for human mobility. Despite the intrinsic uncertainty given by these noisy data, statisticians have paid little attention in the field. This thesis intertwines the statistical analysis of urban mobility based on telephonic data with longitudinal compositional data analysis, that is, the analysis of parts that constitute a whole, where the relative proportions of components change dynamically. Capturing the dependency between successive compositional observations requires tailored methods to take into account the constrained nature of the data. In Chapter 1, we provide a survey of the main probabilistic and statistical methodologies for the exploitation of telephonic data in the field of urban mobility analysis. In Chapter 2, we provide a new approach based on compositional data for urban mobility, where people’s trajectories are represented in the simplex by the proportions of types of roads in their surroundings as they move. We propose a state-space model for the analysis of compositional time series and a model-based clustering approach for the unsupervised aggregation of those series, to uncover the main patterns of human mobility. In Chapter 3, we focus on an alternative approach to longitudinal compositional data analysis, by modeling the observations directly on the simplex and the longitudinal aspect through generalized estimating equations. This more general model is useful for applications where the dependence on the covariates is the main interest.
10-apr-2025
Inglese
CATTELAN, MANUELA
Università degli studi di Padova
File in questo prodotto:
File Dimensione Formato  
tesi_definitiva_Andrea_Panarotto.pdf

embargo fino al 09/04/2028

Dimensione 12.54 MB
Formato Adobe PDF
12.54 MB Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/207732
Il codice NBN di questa tesi è URN:NBN:IT:UNIPD-207732