Guiding the matrix: conditional GANs and meta-learning for image synthesis

Fontanini, Tomaso

Conditioning Generative Adversarial Networks (GANs), in order to guide them, is a fundamental step to better understand the behaviour of GANs and to greatly improve their results and usability. More specifically, this means feeding additional information into the network in order to control the generated samples. The problems addressed in this thesis will cover a broad range of topics like shoe colorization, few shot colorization, multi-domain image-to-image translation, meta-learning, attention transfer and GANs compression. In particular, will be described: firstly, a system able to automatically colorize shoe models starting from a blank or partially colored 3d model; secondly, a new architecture called MetalGAN that combines meta-learning and conditional GANs for few-shot image colorization; thirdly, an extension of MetalGAN that tackles the problem of multi-domain image-to-image translation and outperforms existing methods; finally, a system where attention maps can be used as knowledge to be transferred in a teacher-student paradigm for image-to-image translation improving the student performance. Each of them deals with conditioning an adversarial network in different ways. For each system both qualitative and quantitative experimental results will be presented.

Guiding the matrix: conditional GANs and meta-learning for image synthesis

Fontanini, Tomaso

2021

Abstract

Conditioning Generative Adversarial Networks (GANs), in order to guide them, is a fundamental step to better understand the behaviour of GANs and to greatly improve their results and usability. More specifically, this means feeding additional information into the network in order to control the generated samples. The problems addressed in this thesis will cover a broad range of topics like shoe colorization, few shot colorization, multi-domain image-to-image translation, meta-learning, attention transfer and GANs compression. In particular, will be described: firstly, a system able to automatically colorize shoe models starting from a blank or partially colored 3d model; secondly, a new architecture called MetalGAN that combines meta-learning and conditional GANs for few-shot image colorization; thirdly, an extension of MetalGAN that tackles the problem of multi-domain image-to-image translation and outperforms existing methods; finally, a system where attention maps can be used as knowledge to be transferred in a teacher-student paradigm for image-to-image translation improving the student performance. Each of them deals with conditioning an adversarial network in different ways. For each system both qualitative and quantitative experimental results will be presented.

Scheda breve

Scheda completa

Scheda completa (DC)

	Corso di studio
	
				Dottorato di ricerca in Tecnologie dell'informazione
			
	Data di pubblicazione
	
				mar-2021
			
	Lingua
	
				Italiano
			
	Parola chiave
	
				Deep learning
Conditional generative adversarial networks
Meta-learning
Tranfer learning
			
	Relatore, Supervisor, Advisor o Tutor
	
				Prati, Andrea
			
	Nome Editore
	
				Università degli Studi di Parma
			
	Collezione di appartenenza
	
				Università degli Studi di Parma

File in questo prodotto:

File	Dimensione	Formato
TesiDottorato.pdf accesso solo da BNCF e BNCR Tipologia: Altro materiale allegato Dimensione 57.48 MB Formato Adobe PDF	57.48 MB	Adobe PDF
relazione-finale-dottorato.pdf accesso solo da BNCF e BNCR Tipologia: Altro materiale allegato Dimensione 5.5 kB Formato Adobe PDF	5.5 kB	Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/150462

Il codice NBN di questa tesi è URN:NBN:IT:UNIPR-150462