Development of a radiomic model using machine learning for breast lesion classification and breast cancer subtyping

Rizzo, Veronica

Background Breast cancer is a heterogeneous disease requiring accurate diagnosis and classification for optimal treatment. Conventional imaging methods, such as mammography and ultrasound, have limitations in differentiating benign from malignant lesions and in classifying molecular subtypes. Magnetic Resonance Imaging (MRI), combined with radiomics and machine learning (ML), offers a promising non-invasive approach to improving diagnostic accuracy and tumor subtyping. Purpose This study aimed to develop and validate radiomic models for distinguishing between benign and malignant breast lesions and classifying malignant lesions into molecular subtypes (Luminal A, Luminal B, HER2-positive, and Triple-Negative). Materials and Methods This research was conducted on a dataset of 347 patients, including both retrospective and prospectively collected cases. Radiomic features were extracted from manually segmented lesions using the TRACE4Research™ platform. Five ML models were developed: Logistic Regression, Random Forest, k-Nearest Neighbors, Support Vector Machine, and Multi-Layer Perceptron. Internal validation was performed on a prospective cohort (n=47), while external validation included an independent dataset (n=50). Model performance was evaluated using ROC-AUC, accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Results The Benign vs. Malignant model demonstrated excellent performance, achieving a ROC-AUC of 88%. The model maintained a high ROC-AUC of 86% in external validation. The Luminal vs. Non-Luminal model performed well in internal validation (ROC-AUC 84%) but showed a decline in specificity from 74% (internal) to 63% (external validation). The Luminal A vs. Luminal B model exhibited limited discriminatory power (ROC-AUC 74%), indicating significant challenges in distinguishing these subtypes radiologically. The HER2+ vs. Triple-Negative model retained acceptable reliability, with ROC-AUC 81%. Conclusion Radiomic models, particularly the Benign vs. Malignant classifier, showed high diagnostic accuracy, reinforcing the potential of radiomics in non-invasive breast cancer detection. This approach could support clinical decision-making by providing additional quantitative information, optimizing patient management, and minimizing unnecessary interventions. However, molecular subtyping models exhibited a progressive decline in performance, emphasizing the complexity of distinguishing tumor subtypes based solely on imaging. Future research should explore larger, multi-center datasets and integrate radiomics with molecular biomarkers to enhance precision oncology in breast cancer management.

Development of a radiomic model using machine learning for breast lesion classification and breast cancer subtyping

RIZZO, VERONICA

2025

Abstract

Background Breast cancer is a heterogeneous disease requiring accurate diagnosis and classification for optimal treatment. Conventional imaging methods, such as mammography and ultrasound, have limitations in differentiating benign from malignant lesions and in classifying molecular subtypes. Magnetic Resonance Imaging (MRI), combined with radiomics and machine learning (ML), offers a promising non-invasive approach to improving diagnostic accuracy and tumor subtyping. Purpose This study aimed to develop and validate radiomic models for distinguishing between benign and malignant breast lesions and classifying malignant lesions into molecular subtypes (Luminal A, Luminal B, HER2-positive, and Triple-Negative). Materials and Methods This research was conducted on a dataset of 347 patients, including both retrospective and prospectively collected cases. Radiomic features were extracted from manually segmented lesions using the TRACE4Research™ platform. Five ML models were developed: Logistic Regression, Random Forest, k-Nearest Neighbors, Support Vector Machine, and Multi-Layer Perceptron. Internal validation was performed on a prospective cohort (n=47), while external validation included an independent dataset (n=50). Model performance was evaluated using ROC-AUC, accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Results The Benign vs. Malignant model demonstrated excellent performance, achieving a ROC-AUC of 88%. The model maintained a high ROC-AUC of 86% in external validation. The Luminal vs. Non-Luminal model performed well in internal validation (ROC-AUC 84%) but showed a decline in specificity from 74% (internal) to 63% (external validation). The Luminal A vs. Luminal B model exhibited limited discriminatory power (ROC-AUC 74%), indicating significant challenges in distinguishing these subtypes radiologically. The HER2+ vs. Triple-Negative model retained acceptable reliability, with ROC-AUC 81%. Conclusion Radiomic models, particularly the Benign vs. Malignant classifier, showed high diagnostic accuracy, reinforcing the potential of radiomics in non-invasive breast cancer detection. This approach could support clinical decision-making by providing additional quantitative information, optimizing patient management, and minimizing unnecessary interventions. However, molecular subtyping models exhibited a progressive decline in performance, emphasizing the complexity of distinguishing tumor subtypes based solely on imaging. Future research should explore larger, multi-center datasets and integrate radiomics with molecular biomarkers to enhance precision oncology in breast cancer management.

Scheda breve

Scheda completa

Scheda completa (DC)

	Facoltà/Dipartimento
	
				DIPARTIMENTO DI MEDICINA TRASLAZIONALE E DI PRECISIONE
DIPARTIMENTO DI SCIENZE RADIOLOGICHE, ONCOLOGICHE E ANATOMO-PATOLOGICHE
			
	Corso di studio
	
				Tecnologie biomediche innovative in medicina clinica
			
	Data di pubblicazione
	
				18-feb-2025
			
	Lingua
	
				Inglese
			
	Relatore, Supervisor, Advisor o Tutor
	
				CANTISANI, VITO
PEDICONI, FEDERICA
			
	Correlatore, Controrelatore, Co-Supervisor,  Co-Tutor o Coordinatori
	
				ARCA, Marcello
			
	Nome Editore
	
				Università degli Studi di Roma "La Sapienza"
			
	Numero di pagine
	
				61
			
	Collezione di appartenenza
	
				Università degli Studi di Roma La Sapienza

File in questo prodotto:

File	Dimensione	Formato
Tesi_dottorato_Rizzo.pdf accesso aperto Licenza: Tutti i diritti riservati Dimensione 738.46 kB Formato Adobe PDF Visualizza/Apri	738.46 kB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/200570

Il codice NBN di questa tesi è URN:NBN:IT:UNIROMA1-200570