Next Generation Computing Systems: Improving Memory and Hardware Accelerators Management

Mannino, Mirco

The efficient management of hardware resources is a crucial challenge for modern computing systems, driven by the exponential growth of data processed by contemporary workloads and the proliferation of different and specialized hardware accelerators, which are enabling the scaling of performance and energy efficiency through specialization and integration. One of the most challenging resources to manage is memory, where address translation, in certain scenarios, can significantly impact performance. While increasing page size might mitigate this issue, it introduces challenges in physical memory allocation. Additionally, managing hardware accelerators in heterogeneous systems presents its difficulties. Current solutions, often relying on device drivers, can incur substantial overhead, particularly for fine-grained tasks. This thesis addresses these challenges with innovative solutions. First, we introduce Decoupled Virtual Address Spaces (DeVAS), a memory management technique that inserts an additional address space between virtual and physical address spaces, supported by an extended memory controller. DeVAS effectively reduces address translation overhead and enables efficient management of hybrid memory systems comprising multiple memory technologies. For hardware accelerators management, we propose the ISA Extension for Integrated Accelerator Management (IXIAM) framework. IXIAM employs an ISA extension with specialized instructions to optimize operations such as accelerator reservation, task offloading, synchronization, and fine-grained control of accelerator-local memory, that significantly improve performance in heterogeneous System-on-Chip architectures. Lastly, recognizing the critical role of convolution in convolutional neural networks, we present enhancements for executing convolutions on multi-core systems and optimizing address translation support in convolutional accelerators operating within a shared virtual address space. These contributions collectively address key challenges in memory and hardware accelerators management, pushing the boundaries of modern computing systems efficiency.

Next Generation Computing Systems: Improving Memory and Hardware Accelerators Management

MANNINO, MIRCO

2025

Abstract

The efficient management of hardware resources is a crucial challenge for modern computing systems, driven by the exponential growth of data processed by contemporary workloads and the proliferation of different and specialized hardware accelerators, which are enabling the scaling of performance and energy efficiency through specialization and integration. One of the most challenging resources to manage is memory, where address translation, in certain scenarios, can significantly impact performance. While increasing page size might mitigate this issue, it introduces challenges in physical memory allocation. Additionally, managing hardware accelerators in heterogeneous systems presents its difficulties. Current solutions, often relying on device drivers, can incur substantial overhead, particularly for fine-grained tasks. This thesis addresses these challenges with innovative solutions. First, we introduce Decoupled Virtual Address Spaces (DeVAS), a memory management technique that inserts an additional address space between virtual and physical address spaces, supported by an extended memory controller. DeVAS effectively reduces address translation overhead and enables efficient management of hybrid memory systems comprising multiple memory technologies. For hardware accelerators management, we propose the ISA Extension for Integrated Accelerator Management (IXIAM) framework. IXIAM employs an ISA extension with specialized instructions to optimize operations such as accelerator reservation, task offloading, synchronization, and fine-grained control of accelerator-local memory, that significantly improve performance in heterogeneous System-on-Chip architectures. Lastly, recognizing the critical role of convolution in convolutional neural networks, we present enhancements for executing convolutions on multi-core systems and optimizing address translation support in convolutional accelerators operating within a shared virtual address space. These contributions collectively address key challenges in memory and hardware accelerators management, pushing the boundaries of modern computing systems efficiency.

Scheda breve

Scheda completa

Scheda completa (DC)

	Facoltà/Dipartimento
	
				Dipartimento di Ingegneria dell'Informazione e Scienze Matematiche
			
	Data di pubblicazione
	
				14-mar-2025
			
	Lingua
	
				Inglese
			
	Parola chiave
	
				virtual memory
deep learning
			
	Relatore, Supervisor, Advisor o Tutor
	
				BARTOLINI, SANDRO
			
	Correlatore, Controrelatore, Co-Supervisor,  Co-Tutor o Coordinatori
	
				PECCERILLO, BIAGIO
			
	Nome Editore
	
				Università degli Studi di Siena
			
	Collezione di appartenenza
	
				Università degli Studi di Siena

File in questo prodotto:

File	Dimensione	Formato
phd_unisi_123072.pdf accesso aperto Licenza: Pubblico dominio Dimensione 21.54 MB Formato Adobe PDF Visualizza/Apri	21.54 MB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/196369

Il codice NBN di questa tesi è URN:NBN:IT:UNISI-196369