In the last few years, major milestones have been achieved in the field of artificial intelligence and neural networks. Part of these leaps forward can be explained by the ever-increasing amount of available training data and by the technological advances of modern computers, but these innovations alone cannot justify such unprecedented breakthroughs. In this thesis it will be discussed the idea that a driving factor of these improvements can be identified in innovations related to information representation and propagation. In particular, the majority of these achievements are linked to the development of new architectures rather than to improvements to existing ones, and these architectural innovations are almost always associated with novel information representation and propagation paradigms. In the following chapters, the concepts and motivations underlying these innovations will be discussed, and following those steps, some innovative architectures which use novel information representation and propagation schemes will be presented. In particular, a new form of bio-inspired information representation based on computational stigmergy will be presented and, after its mathematical formalization, two novel architectures based on it called Stigmergic Neural Network and Stigmergic Memory for Recurrent Neural Networks will be derived. Information propagation in sparse neural networks will also be discussed and the novel Mesh Neural Network and Competitive Joint Unstructured Neural Network architectures will be presented, formalized and discussed. It will be shown how some issues in the design of the backward signals of one of the most used reinforcement learning algorithm are the cause of major information loss, and a solution, with its mathematical proof, will be presented and discussed. Finally, the propagation of multimodal information in complex systems will be discussed and an innovative architecture called CLIP-Guided Generative Latent Space Search, capable of generating images from texts and vice versa via the orchestration and optimization of the information through generative and multimodal networks using an algorithm genetic will be presented.

Efficient Information Representation and Propagation in Artificial Neural Networks

GALATOLO, FEDERICO ANDREA
2022

Abstract

In the last few years, major milestones have been achieved in the field of artificial intelligence and neural networks. Part of these leaps forward can be explained by the ever-increasing amount of available training data and by the technological advances of modern computers, but these innovations alone cannot justify such unprecedented breakthroughs. In this thesis it will be discussed the idea that a driving factor of these improvements can be identified in innovations related to information representation and propagation. In particular, the majority of these achievements are linked to the development of new architectures rather than to improvements to existing ones, and these architectural innovations are almost always associated with novel information representation and propagation paradigms. In the following chapters, the concepts and motivations underlying these innovations will be discussed, and following those steps, some innovative architectures which use novel information representation and propagation schemes will be presented. In particular, a new form of bio-inspired information representation based on computational stigmergy will be presented and, after its mathematical formalization, two novel architectures based on it called Stigmergic Neural Network and Stigmergic Memory for Recurrent Neural Networks will be derived. Information propagation in sparse neural networks will also be discussed and the novel Mesh Neural Network and Competitive Joint Unstructured Neural Network architectures will be presented, formalized and discussed. It will be shown how some issues in the design of the backward signals of one of the most used reinforcement learning algorithm are the cause of major information loss, and a solution, with its mathematical proof, will be presented and discussed. Finally, the propagation of multimodal information in complex systems will be discussed and an innovative architecture called CLIP-Guided Generative Latent Space Search, capable of generating images from texts and vice versa via the orchestration and optimization of the information through generative and multimodal networks using an algorithm genetic will be presented.
11-giu-2022
Italiano
Artificial Intelligence
Artificial Neural Networks
Deep Learning
Machine Learning
Cimino, Mario Giovanni Cosimo Antonio
Vaglini, Gigliola
File in questo prodotto:
File Dimensione Formato  
report_fine_corso.pdf

non disponibili

Dimensione 236.82 kB
Formato Adobe PDF
236.82 kB Adobe PDF
thesis.pdf

embargo fino al 02/05/2062

Dimensione 5.84 MB
Formato Adobe PDF
5.84 MB Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/215977
Il codice NBN di questa tesi è URN:NBN:IT:UNIPI-215977