Language and Knowledge Representation: A Stratified Approach

Bagchi, Mayukh

Semantic heterogeneity is conventionally understood as the existence of variance in the representation of the same target reality when computationally modelled by independent parties. It can have serious implications in critical application scenarios like that of Knowledge Graph-based multilingual data integration. In view of the above, the thesis argues that the current understanding of the problem of semantic heterogeneity as the ‘existence of variance’, while being crucially necessary, is not sufficient and undercharacterized. There can be no variance without a prior notion of a unifying reference taken as the basis for computing the variance itself. To that end, the thesis proposes the problem of representation heterogeneity to emphasize the fact that heterogeneity is an intrinsic property of any representation, wherein, different observers encode different representations of the same target reality in a stratified manner using different concepts, language and knowledge (as well as data). The thesis then advances a top-down solution approach to the above stratified problem of representation heterogeneity in terms of several solution components, namely: (i) a representation formalism stratified into concept level, language level, knowledge level and data level to accommodate representation heterogeneity, (ii) a top-down language representation using Universal Knowledge Core (UKC), UKC namespaces and domain languages to tackle the conceptual and language level heterogeneity, (iii) a top-down knowledge representation using the notions of language teleontology and knowledge teleontology to tackle the knowledge level heterogeneity, (iv) the usage and further development of the existing LiveKnowledge catalog for enforcing iterative reuse and sharing of language and knowledge representations, and, (v) the kTelos methodology integrating the solution components above to iteratively generate the language and knowledge representations absolving representation heterogeneity. The thesis also includes proof-of-concepts of the language and knowledge representations developed for two international research projects - DataScientia (data catalogs) and JIDEP (materials modelling). Finally, the thesis concludes with future lines of research.

Language and Knowledge Representation: A Stratified Approach

Bagchi, Mayukh

2025

Abstract

Semantic heterogeneity is conventionally understood as the existence of variance in the representation of the same target reality when computationally modelled by independent parties. It can have serious implications in critical application scenarios like that of Knowledge Graph-based multilingual data integration. In view of the above, the thesis argues that the current understanding of the problem of semantic heterogeneity as the ‘existence of variance’, while being crucially necessary, is not sufficient and undercharacterized. There can be no variance without a prior notion of a unifying reference taken as the basis for computing the variance itself. To that end, the thesis proposes the problem of representation heterogeneity to emphasize the fact that heterogeneity is an intrinsic property of any representation, wherein, different observers encode different representations of the same target reality in a stratified manner using different concepts, language and knowledge (as well as data). The thesis then advances a top-down solution approach to the above stratified problem of representation heterogeneity in terms of several solution components, namely: (i) a representation formalism stratified into concept level, language level, knowledge level and data level to accommodate representation heterogeneity, (ii) a top-down language representation using Universal Knowledge Core (UKC), UKC namespaces and domain languages to tackle the conceptual and language level heterogeneity, (iii) a top-down knowledge representation using the notions of language teleontology and knowledge teleontology to tackle the knowledge level heterogeneity, (iv) the usage and further development of the existing LiveKnowledge catalog for enforcing iterative reuse and sharing of language and knowledge representations, and, (v) the kTelos methodology integrating the solution components above to iteratively generate the language and knowledge representations absolving representation heterogeneity. The thesis also includes proof-of-concepts of the language and knowledge representations developed for two international research projects - DataScientia (data catalogs) and JIDEP (materials modelling). Finally, the thesis concludes with future lines of research.

Scheda breve

Scheda completa

Scheda completa (DC)

	Facoltà/Dipartimento
	
				Ingegneria e scienza dell'Informaz (29/10/12-)
			
	Corso di studio
	
				Information and Communication Technology
			
	Data di pubblicazione
	
				14-apr-2025
			
	Lingua
	
				Inglese
			
	Relatore, Supervisor, Advisor o Tutor
	
				Giunchiglia, Fausto
			
	Nome Editore
	
				Università degli studi di Trento
			
	Città Editore
	
				TRENTO
			
	Numero di pagine
	
				133
			
	Collezione di appartenenza
	
				Università degli Studi di Trento

File in questo prodotto:

File	Dimensione	Formato
2025 - MyPhD - Language_and_Knowledge_Representation - A_Stratified_Approach - IRIS.pdf accesso aperto Dimensione 11.92 MB Formato Adobe PDF Visualizza/Apri	11.92 MB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/208390

Il codice NBN di questa tesi è URN:NBN:IT:UNITN-208390