MODELING SEMANTIC CHANGE THROUGH LARGE LANGUAGE MODELS

Periti, Francesco

This PhD thesis focuses on computational modeling of semantic change through large language models. It investigates the modeling of lexical semantic change, where words change in meaning over time, and extends this study to model text-level semantic change, specifically historical resonance. The thesis addresses three research questions: (1) how to model lexical semantic change using large language models, (2) how to extend the modeling from two time periods to multiple time periods, and (3) how to model historical resonance. The thesis reviews state-of-the-art approaches, proposes novel frameworks, and evaluates different methods across multiple languages. Findings suggest that word embeddings of BERT models can be used to accurately detect semantic change. The thesis also underscores the interpretability and effectiveness of lexical replacements and sense definitions generated by Llama and Flan-T5 models in modeling word meaning. By advancing methodologies and tools in Natural Language Processing and Computational Linguistics, this thesis contributes significantly to modeling the dynamic nature of text semantics over time.

MODELING SEMANTIC CHANGE THROUGH LARGE LANGUAGE MODELS

PERITI, FRANCESCO

2024

Abstract

This PhD thesis focuses on computational modeling of semantic change through large language models. It investigates the modeling of lexical semantic change, where words change in meaning over time, and extends this study to model text-level semantic change, specifically historical resonance. The thesis addresses three research questions: (1) how to model lexical semantic change using large language models, (2) how to extend the modeling from two time periods to multiple time periods, and (3) how to model historical resonance. The thesis reviews state-of-the-art approaches, proposes novel frameworks, and evaluates different methods across multiple languages. Findings suggest that word embeddings of BERT models can be used to accurately detect semantic change. The thesis also underscores the interpretability and effectiveness of lexical replacements and sense definitions generated by Llama and Flan-T5 models in modeling word meaning. By advancing methodologies and tools in Natural Language Processing and Computational Linguistics, this thesis contributes significantly to modeling the dynamic nature of text semantics over time.

Scheda breve

Scheda completa

Scheda completa (DC)

	Facoltà/Dipartimento
	
				Dipartimento di Informatica Giovanni Degli Antoni
			
	Corso di studio
	
				INFORMATICA
			
	Data di pubblicazione
	
				5-dic-2024
			
	Lingua
	
				Inglese
			
	Relatore, Supervisor, Advisor o Tutor
	
				MONTANELLI, STEFANO
			
	Correlatore, Controrelatore, Co-Supervisor,  Co-Tutor o Coordinatori
	
				SASSI, ROBERTO
			
	Nome Editore
	
				Università degli Studi di Milano
			
	Città Editore
	
				University of Milan
			
	Numero di pagine
	
				270
			
	Collezione di appartenenza
	
				Università degli Studi di Milano

File in questo prodotto:

File	Dimensione	Formato
phd_unimi_R13329.pdf accesso aperto Licenza: Tutti i diritti riservati Dimensione 6.07 MB Formato Adobe PDF Visualizza/Apri	6.07 MB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/184245

Il codice NBN di questa tesi è URN:NBN:IT:UNIMI-184245