Antimicrobial resistance (AMR) constitutes a major challenge for contemporary healthcare systems, particularly within hospital environments where selective pressure, patient mobility, and heterogeneous transmission dynamics interact. This thesis addresses AMR through two complementary methodological perspectives: data-driven approaches based on machine learning (ML) and mechanistic population models grounded in mathematical epidemiology. The first part focuses on predictive modeling and data reconstruction in AMR research. Using genomic and phenotypic datasets, including proprietary, public, and synthetic data, the study investigates the performance of multiple ML algorithms for AMR prediction and missing data imputation. Specific preprocessing strategies (correlation analysis, feature handling, and data balancing) are evaluated. Results indicate that resistance phenotypes can be accurately predicted when genomic determinants are adequately represented, confirming that resistance patterns are reflected in identifiable genomic signatures. Model performance remains closely linked to the biological relevance and representativeness of the available features, underscoring the importance of informed feature selection and domain knowledge in achieving reliable predictions. In parallel, imputation analyses show that several ML strategies effectively reconstruct incomplete datasets under different missingness mechanisms, with robustness supported by the structural properties of the data and the informativeness of the variables. The second part shifts to mechanistic approaches, developing compartmental models to analyze AMR transmission and strain competition in hospital and spatially structured settings. Building upon classical epidemic theory (SIS, SIR) and the Next Generation Matrix (NGM) formalism, the thesis introduces the XSR model to describe sensitive–resistant dynamics at single-ward and metapopulation levels. Analytical derivation of equilibria, stability conditions, and reproduction numbers provides explicit thresholds governing resistance persistence. The framework is extended through the formulation of a novel SIIS model incorporating spatial mobility and competitive interactions between strains. Deterministic and stochastic simulations validate the theoretical results and reveal how fitness costs, selective pressure, spatial heterogeneity, and mobility patterns shape invasion and coexistence scenarios. The thesis concludes with a general discussion that connects the results of the two parts, emphasizing how data-driven and theoretical modeling approaches can jointly enhance the understanding and management of AMR. By integrating ML and population dynamics, the work aims to provide methodological and conceptual contributions that may support future research and inform decision-making in clinical and public health contexts.
La resistenza antimicrobica (AMR) costituisce una delle principali sfide per i sistemi sanitari contemporanei, in particolare negli ambienti ospedalieri, dove pressione selettiva, mobilità dei pazienti e dinamiche di trasmissione eterogenee interagiscono tra loro. La presente tesi affronta l’AMR attraverso due prospettive metodologiche complementari: approcci data-driven basati sul machine learning (ML) e modelli di popolazione di tipo meccanicistico fondati sull’epidemiologia matematica. La prima parte si concentra sulla modellazione predittiva e sulla ricostruzione dei dati nell’ambito della ricerca sull’AMR. Utilizzando dataset genomici e fenotipici, inclusi dati proprietari, pubblici e sintetici, lo studio analizza le prestazioni di diversi algoritmi di ML per la previsione della resistenza antimicrobica e per l’imputazione di dati mancanti. Vengono valutate specifiche strategie di preprocessing (analisi di correlazione, gestione delle feature e bilanciamento dei dati). I risultati indicano che i fenotipi di resistenza possono essere previsti con accuratezza quando i determinanti genomici sono adeguatamente rappresentati, confermando che i pattern di resistenza si riflettono in firme genomiche identificabili. Le prestazioni dei modelli risultano strettamente legate alla rilevanza biologica e alla rappresentatività delle variabili disponibili, evidenziando l’importanza di una selezione informata delle feature e dell’integrazione con la conoscenza di dominio per ottenere previsioni affidabili. Parallelamente, le analisi di imputazione mostrano che diverse strategie di ML consentono di ricostruire efficacemente dataset incompleti in presenza di differenti meccanismi di mancanza dei dati, con una robustezza supportata dalle proprietà strutturali dei dati e dall’informatività delle variabili. La seconda parte adotta un approccio meccanicistico, sviluppando modelli compartimentali per analizzare la trasmissione dell’AMR e la competizione tra ceppi in contesti ospedalieri e spazialmente strutturati. Sulla base della teoria epidemiologica classica (modelli SIS, SIR) e del formalismo della Next Generation Matrix (NGM), la tesi introduce il modello XSR per descrivere le dinamiche tra ceppi sensibili e resistenti a livello di singolo reparto e di metapopolazione. La derivazione analitica degli equilibri, delle condizioni di stabilità e dei numeri di riproduzione fornisce soglie esplicite che regolano la persistenza della resistenza. Il quadro teorico viene ulteriormente esteso mediante la formulazione di un nuovo modello SIIS, che incorpora mobilità spaziale e interazioni competitive tra ceppi. Simulazioni deterministiche e stocastiche validano i risultati teorici e mostrano come costi di fitness, pressione selettiva, eterogeneità spaziale e pattern di mobilità influenzino gli scenari di invasione e coesistenza. La tesi si conclude con una discussione generale che collega i risultati delle due parti, evidenziando come approcci data-driven e modellazione teorica possano integrarsi nel migliorare la comprensione e la gestione dell’AMR. Integrando machine learning e dinamica delle popolazioni, il lavoro mira a fornire contributi metodologici e concettuali utili a supportare la ricerca futura e a informare i processi decisionali in ambito clinico e di sanità pubblica.
Engineering methods for modeling antimicrobial resistance [Metodi ingegneristici per la modellazione della resistenza antimicrobica]
CONDORELLI, CHIARA
2026
Abstract
Antimicrobial resistance (AMR) constitutes a major challenge for contemporary healthcare systems, particularly within hospital environments where selective pressure, patient mobility, and heterogeneous transmission dynamics interact. This thesis addresses AMR through two complementary methodological perspectives: data-driven approaches based on machine learning (ML) and mechanistic population models grounded in mathematical epidemiology. The first part focuses on predictive modeling and data reconstruction in AMR research. Using genomic and phenotypic datasets, including proprietary, public, and synthetic data, the study investigates the performance of multiple ML algorithms for AMR prediction and missing data imputation. Specific preprocessing strategies (correlation analysis, feature handling, and data balancing) are evaluated. Results indicate that resistance phenotypes can be accurately predicted when genomic determinants are adequately represented, confirming that resistance patterns are reflected in identifiable genomic signatures. Model performance remains closely linked to the biological relevance and representativeness of the available features, underscoring the importance of informed feature selection and domain knowledge in achieving reliable predictions. In parallel, imputation analyses show that several ML strategies effectively reconstruct incomplete datasets under different missingness mechanisms, with robustness supported by the structural properties of the data and the informativeness of the variables. The second part shifts to mechanistic approaches, developing compartmental models to analyze AMR transmission and strain competition in hospital and spatially structured settings. Building upon classical epidemic theory (SIS, SIR) and the Next Generation Matrix (NGM) formalism, the thesis introduces the XSR model to describe sensitive–resistant dynamics at single-ward and metapopulation levels. Analytical derivation of equilibria, stability conditions, and reproduction numbers provides explicit thresholds governing resistance persistence. The framework is extended through the formulation of a novel SIIS model incorporating spatial mobility and competitive interactions between strains. Deterministic and stochastic simulations validate the theoretical results and reveal how fitness costs, selective pressure, spatial heterogeneity, and mobility patterns shape invasion and coexistence scenarios. The thesis concludes with a general discussion that connects the results of the two parts, emphasizing how data-driven and theoretical modeling approaches can jointly enhance the understanding and management of AMR. By integrating ML and population dynamics, the work aims to provide methodological and conceptual contributions that may support future research and inform decision-making in clinical and public health contexts.| File | Dimensione | Formato | |
|---|---|---|---|
|
Engineering Methods for Modeling Antimicrobial Resistance - Chiara Condorelli.pdf
accesso aperto
Licenza:
Tutti i diritti riservati
Dimensione
6.68 MB
Formato
Adobe PDF
|
6.68 MB | Adobe PDF | Visualizza/Apri |
I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/20.500.14242/374142
URN:NBN:IT:UNICT-374142