Semantic heterogeneity is conventionally understood as the existence of variance in the representation of the same target reality when computationally modelled by independent parties. It can have serious implications in critical application scenarios like that of Knowledge Graph-based multilingual data integration. In view of the above, the thesis argues that the current understanding of the problem of semantic heterogeneity as the ‘existence of variance’, while being crucially necessary, is not sufficient and undercharacterized. There can be no variance without a prior notion of a unifying reference taken as the basis for computing the variance itself. To that end, the thesis proposes the problem of representation heterogeneity to emphasize the fact that heterogeneity is an intrinsic property of any representation, wherein, different observers encode different representations of the same target reality in a stratified manner using different concepts, language and knowledge (as well as data). The thesis then advances a top-down solution approach to the above stratified problem of representation heterogeneity in terms of several solution components, namely: (i) a representation formalism stratified into concept level, language level, knowledge level and data level to accommodate representation heterogeneity, (ii) a top-down language representation using Universal Knowledge Core (UKC), UKC namespaces and domain languages to tackle the conceptual and language level heterogeneity, (iii) a top-down knowledge representation using the notions of language teleontology and knowledge teleontology to tackle the knowledge level heterogeneity, (iv) the usage and further development of the existing LiveKnowledge catalog for enforcing iterative reuse and sharing of language and knowledge representations, and, (v) the kTelos methodology integrating the solution components above to iteratively generate the language and knowledge representations absolving representation heterogeneity. The thesis also includes proof-of-concepts of the language and knowledge representations developed for two international research projects - DataScientia (data catalogs) and JIDEP (materials modelling). Finally, the thesis concludes with future lines of research.

Language and Knowledge Representation: A Stratified Approach

Bagchi, Mayukh
2025

Abstract

Semantic heterogeneity is conventionally understood as the existence of variance in the representation of the same target reality when computationally modelled by independent parties. It can have serious implications in critical application scenarios like that of Knowledge Graph-based multilingual data integration. In view of the above, the thesis argues that the current understanding of the problem of semantic heterogeneity as the ‘existence of variance’, while being crucially necessary, is not sufficient and undercharacterized. There can be no variance without a prior notion of a unifying reference taken as the basis for computing the variance itself. To that end, the thesis proposes the problem of representation heterogeneity to emphasize the fact that heterogeneity is an intrinsic property of any representation, wherein, different observers encode different representations of the same target reality in a stratified manner using different concepts, language and knowledge (as well as data). The thesis then advances a top-down solution approach to the above stratified problem of representation heterogeneity in terms of several solution components, namely: (i) a representation formalism stratified into concept level, language level, knowledge level and data level to accommodate representation heterogeneity, (ii) a top-down language representation using Universal Knowledge Core (UKC), UKC namespaces and domain languages to tackle the conceptual and language level heterogeneity, (iii) a top-down knowledge representation using the notions of language teleontology and knowledge teleontology to tackle the knowledge level heterogeneity, (iv) the usage and further development of the existing LiveKnowledge catalog for enforcing iterative reuse and sharing of language and knowledge representations, and, (v) the kTelos methodology integrating the solution components above to iteratively generate the language and knowledge representations absolving representation heterogeneity. The thesis also includes proof-of-concepts of the language and knowledge representations developed for two international research projects - DataScientia (data catalogs) and JIDEP (materials modelling). Finally, the thesis concludes with future lines of research.
14-apr-2025
Inglese
Giunchiglia, Fausto
Università degli studi di Trento
TRENTO
133
File in questo prodotto:
File Dimensione Formato  
2025 - MyPhD - Language_and_Knowledge_Representation - A_Stratified_Approach - IRIS.pdf

accesso aperto

Dimensione 11.92 MB
Formato Adobe PDF
11.92 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/208390
Il codice NBN di questa tesi è URN:NBN:IT:UNITN-208390