Architectures and Algorithms for Learning Without Data Accumulation over Time

Casoni, Michele

The increasing demand for intelligent systems capable of operating in real-world, temporally dynamic environments calls for a fundamental rethinking of conventional machine learning paradigms. Traditional models rely heavily on the accumulation, storage, and repeated access to vast datasets—a process that contrasts sharply with how biological systems learn: incrementally, causally, and without revisiting the past. This thesis addresses the theoretical and practical challenges involved in designing learning systems that operate without data accumulation over time, laying the foundation for a new class of intelligent agents under the umbrella of Collectionless Artificial Intelligence. At its core, this work proposes learning algorithms and architectures designed to function online, in continuous time, and without reliance on stored data, under strict temporal and spatial locality constraints. In this collectionless setting, the agent is prohibited from buffering or revisiting past inputs, reflecting a more biologically plausible and ethically favorable approach to learning, particularly relevant in edge computing scenarios where privacy, energy efficiency, and decentralization are key. A key contribution is the formulation of the perpetual generation problem, which focuses on enabling systems to autonomously produce consistent and meaningful sequences over indefinite time horizons, without external inputs or memory. The thesis provides a detailed analysis of marginal stability in both linear and nonlinear recurrent neural networks, establishing spectral and dynamical criteria that ensure stable internal dynamics suitable for sustained inference and generation. Another major innovation is the introduction of the Hamiltonian Learning framework, which recasts learning as an optimal control problem solved forward in time. Unlike traditional methods such as backpropagation through time, this approach leverages Hamilton equations to derive learning rules that are local in both time and space. This not only enhances biological plausibility but also enables scalable training of deep and recurrent networks in settings where memory and computational resources are limited. The proposed theories and models are supported by comprehensive experimental validation across a range of tasks, including online sequence generation and streaming data classification. These experiments confirm the effectiveness of the methods under continuous-time constraints, where conventional approaches typically fail due to their reliance on data storage. By integrating perspectives from machine learning and control theory, this thesis advances the vision of self-sufficient, robust, and adaptive learning systems. The methods developed here point toward a new generation of intelligent agents—ones that operate efficiently, respect privacy, and more faithfully mirror the continuous, embodied learning seen in natural organisms.

Architectures and Algorithms for Learning Without Data Accumulation over Time

CASONI, MICHELE

2025

Abstract

The increasing demand for intelligent systems capable of operating in real-world, temporally dynamic environments calls for a fundamental rethinking of conventional machine learning paradigms. Traditional models rely heavily on the accumulation, storage, and repeated access to vast datasets—a process that contrasts sharply with how biological systems learn: incrementally, causally, and without revisiting the past. This thesis addresses the theoretical and practical challenges involved in designing learning systems that operate without data accumulation over time, laying the foundation for a new class of intelligent agents under the umbrella of Collectionless Artificial Intelligence. At its core, this work proposes learning algorithms and architectures designed to function online, in continuous time, and without reliance on stored data, under strict temporal and spatial locality constraints. In this collectionless setting, the agent is prohibited from buffering or revisiting past inputs, reflecting a more biologically plausible and ethically favorable approach to learning, particularly relevant in edge computing scenarios where privacy, energy efficiency, and decentralization are key. A key contribution is the formulation of the perpetual generation problem, which focuses on enabling systems to autonomously produce consistent and meaningful sequences over indefinite time horizons, without external inputs or memory. The thesis provides a detailed analysis of marginal stability in both linear and nonlinear recurrent neural networks, establishing spectral and dynamical criteria that ensure stable internal dynamics suitable for sustained inference and generation. Another major innovation is the introduction of the Hamiltonian Learning framework, which recasts learning as an optimal control problem solved forward in time. Unlike traditional methods such as backpropagation through time, this approach leverages Hamilton equations to derive learning rules that are local in both time and space. This not only enhances biological plausibility but also enables scalable training of deep and recurrent networks in settings where memory and computational resources are limited. The proposed theories and models are supported by comprehensive experimental validation across a range of tasks, including online sequence generation and streaming data classification. These experiments confirm the effectiveness of the methods under continuous-time constraints, where conventional approaches typically fail due to their reliance on data storage. By integrating perspectives from machine learning and control theory, this thesis advances the vision of self-sufficient, robust, and adaptive learning systems. The methods developed here point toward a new generation of intelligent agents—ones that operate efficiently, respect privacy, and more faithfully mirror the continuous, embodied learning seen in natural organisms.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				29-lug-2025
			
	Lingua
	
				Inglese
			
	Relatore, Supervisor, Advisor o Tutor
	
				GORI, MARCO
			
	Correlatore, Controrelatore, Co-Supervisor,  Co-Tutor o Coordinatori
	
				GORI, MARCO
MELACCI, STEFANO
			
	Nome Editore
	
				Università degli Studi di Siena
			
	Numero di pagine
	
				106
			
	Collezione di appartenenza
	
				Università degli Studi di Siena

File in questo prodotto:

File	Dimensione	Formato
phd_unisi_120613.pdf accesso aperto Dimensione 16.29 MB Formato Adobe PDF Visualizza/Apri	16.29 MB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/218182

Il codice NBN di questa tesi è URN:NBN:IT:UNISI-218182