This PhD thesis focuses on computational modeling of semantic change through large language models. It investigates the modeling of lexical semantic change, where words change in meaning over time, and extends this study to model text-level semantic change, specifically historical resonance. The thesis addresses three research questions: (1) how to model lexical semantic change using large language models, (2) how to extend the modeling from two time periods to multiple time periods, and (3) how to model historical resonance. The thesis reviews state-of-the-art approaches, proposes novel frameworks, and evaluates different methods across multiple languages. Findings suggest that word embeddings of BERT models can be used to accurately detect semantic change. The thesis also underscores the interpretability and effectiveness of lexical replacements and sense definitions generated by Llama and Flan-T5 models in modeling word meaning. By advancing methodologies and tools in Natural Language Processing and Computational Linguistics, this thesis contributes significantly to modeling the dynamic nature of text semantics over time.

MODELING SEMANTIC CHANGE THROUGH LARGE LANGUAGE MODELS

PERITI, FRANCESCO
2024

Abstract

This PhD thesis focuses on computational modeling of semantic change through large language models. It investigates the modeling of lexical semantic change, where words change in meaning over time, and extends this study to model text-level semantic change, specifically historical resonance. The thesis addresses three research questions: (1) how to model lexical semantic change using large language models, (2) how to extend the modeling from two time periods to multiple time periods, and (3) how to model historical resonance. The thesis reviews state-of-the-art approaches, proposes novel frameworks, and evaluates different methods across multiple languages. Findings suggest that word embeddings of BERT models can be used to accurately detect semantic change. The thesis also underscores the interpretability and effectiveness of lexical replacements and sense definitions generated by Llama and Flan-T5 models in modeling word meaning. By advancing methodologies and tools in Natural Language Processing and Computational Linguistics, this thesis contributes significantly to modeling the dynamic nature of text semantics over time.
5-dic-2024
Inglese
MONTANELLI, STEFANO
SASSI, ROBERTO
Università degli Studi di Milano
University of Milan
270
File in questo prodotto:
File Dimensione Formato  
phd_unimi_R13329.pdf

accesso aperto

Dimensione 6.07 MB
Formato Adobe PDF
6.07 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/184245
Il codice NBN di questa tesi è URN:NBN:IT:UNIMI-184245