This thesis is about three-independent chapters. It deals with model-based clustering in different framework and with different approaches. Also employed data are quite different, especially between the first chapter and the others. Basically, the clustering procedures are applied on economic time series and they can represent a fondation for further research topics. In the first chapter we apply a clustering procedure to detect trend changes in macroeconomic data, focusing on the GDP time series for the G-7 countries. Two popular trend-cycle decompositions (i.e., Beveridge and Nelson Decom position and Hodrick and Prescott filter) are considered in a preliminary step of the analysis, and we stress the differences between the two methods in terms of the inferred clustering. A finite mixture of regression models is considered to show different patterns and changes in GDP slopes over time. This approach can be used also to detect structural breaks or change points, and it is an alter native to existing approaches in a probabilistic framework and we also discuss international changes in GDP distribution for the G-7 countries, highlighting similarities, e.g., in break dates, aiming at adding more insights on the economic integration among countries. Our findings are that our model is able to represent economic paths of every countries and by looking at the changes in slope of the long-trend component of the GDP, we are able to investigate change points, also compared with alternative approaches. In the second chapter we provide an empirical analysis on the main univariate and multivariate stylized facts in the two of the largest cryptocurrencies, namely Ethereum and Bitcoin, return series. A Markov-Switching Vector Auto Regres sion model is considered to further explore the dynamic relationships between cryptocurrencies and other financial assets, such as gold, S&P and oil. We 5 estimate the presence of volatility clustering, a rapid decay of the autocorrelation function, an excess of kurtosis and multivariate little cross-correlation across the series, except for contemporaneous returns. The model well represent tha univariate and multivariate stylized facts, giving an insight on the considered crypto-currencies as pure financial asset; moreover, we find a relationship between the response variable and the autoregression part and (some of) the exogeneous variables considered. Finally, in the third chapter we introduce multivariate models for analyzing several stock returns series of italian football teams such as AS Roma, FC Juventus, SS Lazio, in order to describe the relationship across these series and to model the evolution over time of the stock returns in a very particular framework; in fact, stock returns of a football team can be influenced both by football performances (national and international) and by non-football events too, like a change in management or a purchase of a superstar footballer. A natural way to model the dependence over time is by using the hidden Markov models and his generalization, hidden semi-Markov models by relaxing the assumption on the so-called sojourn distribution of the hidden states. Instead for the conditional distributions of the observed data (i.e., the emission distribution) we use the multivariate leptokurtic-normal distribution, a generalization of the multivariate normal, with an additional parameter β which describes the excess of kurtosis. Furthermore, some multivariate stylized facts are also investigated. Parameters estimation is performed by Expectation-Maximization (EM) algorithm type which maximizes the log-likelihood function, allowing us to deal with a classification problem as a missing data problem. R has been employed as software; packages such as flexmix (Gr¨un et al., 2007; Gr¨un and Leisch, 2008; Leisch, 2004b), nhmsar (Ailliot et al., 2015) and mhsmm (O’Connell et al., 2011) along side with some custom functions have been used for computation procedures. As we already said, this thesis is about three independent chapters; in order to avoid misunderstanding, every chapter has its own notation.

Essays on model-based clustering for macroeconomic and financial data

CREMASCHINI, ALESSANDRO
2022

Abstract

This thesis is about three-independent chapters. It deals with model-based clustering in different framework and with different approaches. Also employed data are quite different, especially between the first chapter and the others. Basically, the clustering procedures are applied on economic time series and they can represent a fondation for further research topics. In the first chapter we apply a clustering procedure to detect trend changes in macroeconomic data, focusing on the GDP time series for the G-7 countries. Two popular trend-cycle decompositions (i.e., Beveridge and Nelson Decom position and Hodrick and Prescott filter) are considered in a preliminary step of the analysis, and we stress the differences between the two methods in terms of the inferred clustering. A finite mixture of regression models is considered to show different patterns and changes in GDP slopes over time. This approach can be used also to detect structural breaks or change points, and it is an alter native to existing approaches in a probabilistic framework and we also discuss international changes in GDP distribution for the G-7 countries, highlighting similarities, e.g., in break dates, aiming at adding more insights on the economic integration among countries. Our findings are that our model is able to represent economic paths of every countries and by looking at the changes in slope of the long-trend component of the GDP, we are able to investigate change points, also compared with alternative approaches. In the second chapter we provide an empirical analysis on the main univariate and multivariate stylized facts in the two of the largest cryptocurrencies, namely Ethereum and Bitcoin, return series. A Markov-Switching Vector Auto Regres sion model is considered to further explore the dynamic relationships between cryptocurrencies and other financial assets, such as gold, S&P and oil. We 5 estimate the presence of volatility clustering, a rapid decay of the autocorrelation function, an excess of kurtosis and multivariate little cross-correlation across the series, except for contemporaneous returns. The model well represent tha univariate and multivariate stylized facts, giving an insight on the considered crypto-currencies as pure financial asset; moreover, we find a relationship between the response variable and the autoregression part and (some of) the exogeneous variables considered. Finally, in the third chapter we introduce multivariate models for analyzing several stock returns series of italian football teams such as AS Roma, FC Juventus, SS Lazio, in order to describe the relationship across these series and to model the evolution over time of the stock returns in a very particular framework; in fact, stock returns of a football team can be influenced both by football performances (national and international) and by non-football events too, like a change in management or a purchase of a superstar footballer. A natural way to model the dependence over time is by using the hidden Markov models and his generalization, hidden semi-Markov models by relaxing the assumption on the so-called sojourn distribution of the hidden states. Instead for the conditional distributions of the observed data (i.e., the emission distribution) we use the multivariate leptokurtic-normal distribution, a generalization of the multivariate normal, with an additional parameter β which describes the excess of kurtosis. Furthermore, some multivariate stylized facts are also investigated. Parameters estimation is performed by Expectation-Maximization (EM) algorithm type which maximizes the log-likelihood function, allowing us to deal with a classification problem as a missing data problem. R has been employed as software; packages such as flexmix (Gr¨un et al., 2007; Gr¨un and Leisch, 2008; Leisch, 2004b), nhmsar (Ailliot et al., 2015) and mhsmm (O’Connell et al., 2011) along side with some custom functions have been used for computation procedures. As we already said, this thesis is about three independent chapters; in order to avoid misunderstanding, every chapter has its own notation.
27-set-2022
Inglese
Latent growth model; hidden markov model; hidden semi-markov model; markov switching autoregerssive model; time series; structural breaks; mixture model; leptocurtic distribution
LISEO, Brunero
LISEO, Brunero
Università degli Studi di Roma "La Sapienza"
File in questo prodotto:
File Dimensione Formato  
Tesi_dottorato_Cremaschini.pdf

accesso aperto

Dimensione 3.31 MB
Formato Adobe PDF
3.31 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/100184
Il codice NBN di questa tesi è URN:NBN:IT:UNIROMA1-100184