Knowledge transfer and retention in deep neural networks

Fini, Enrico

doi:10.15168/11572_374590

This thesis addresses the crucial problem of knowledge transfer and retention in deep neural networks. The ability to transfer knowledge from previously learned tasks and retain it for future use is essential for machine learning models to continually adapt to new tasks and improve their overall performance. In principle, knowledge can be transferred between any type of task, but we believe it to be particularly challenging in the field of computer vision, where the size and diversity of visual data often result in high compute requirements and the need for large, complex models. Hence, we analyze transfer and retention learning between unsupervised and supervised visual tasks, which form the main focus of this thesis. We categorize our efforts into several knowledge transfer and retention paradigms, and we tackle them with several contributions for the scientific community. The thesis proposes settings and methods based on knowledge distillation and self-supervised learning techniques. In particular, we devise two novel continual learning settings and seven new methods for knowledge transfer and retention, setting new state-of-the-art in a wide range of tasks. In conclusion, this thesis provides a valuable contribution to the field of computer vision and machine learning and sets a foundation for future work in this area.

Knowledge transfer and retention in deep neural networks / Fini, Enrico. - (2023 Apr 17), pp. 1-227. [10.15168/11572_374590]

Knowledge transfer and retention in deep neural networks

Fini, Enrico

2023-04-17

Abstract

This thesis addresses the crucial problem of knowledge transfer and retention in deep neural networks. The ability to transfer knowledge from previously learned tasks and retain it for future use is essential for machine learning models to continually adapt to new tasks and improve their overall performance. In principle, knowledge can be transferred between any type of task, but we believe it to be particularly challenging in the field of computer vision, where the size and diversity of visual data often result in high compute requirements and the need for large, complex models. Hence, we analyze transfer and retention learning between unsupervised and supervised visual tasks, which form the main focus of this thesis. We categorize our efforts into several knowledge transfer and retention paradigms, and we tackle them with several contributions for the scientific community. The thesis proposes settings and methods based on knowledge distillation and self-supervised learning techniques. In particular, we devise two novel continual learning settings and seven new methods for knowledge transfer and retention, setting new state-of-the-art in a wide range of tasks. In conclusion, this thesis provides a valuable contribution to the field of computer vision and machine learning and sets a foundation for future work in this area.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di esame finale/Defended on
	
				17-apr-2023
			
	Ciclo
	
				XXXV
			
	Anno Accademico
	
				2022-2023
			
	Dipartimento
	
				Ingegneria e scienza dell'Informaz (29/10/12-)
			
	Corso di dottorato
	
				Industrial Innovation
			
	Supervisore/Relatore di tesi Unitn (Unitn internal supervisor)
	
				Ricci, Elisa
			
	Supervisore/Relatore di tesi esterno (External supervisor)
	
				Moin Nabi
			
	Tesi in cotutela (Bi-nationally supervised Doctoral Thesis)
	
				no
			
	Paese dell'Istituzione/ente esterno in caso di cotutela o collaborazioni internazionali (Country of the Institution in case of bi-nationally supervised PhD thesis or other international collaborations).
	
				GERMANIA
			
	Codice DOI
	
				https://dx.doi.org/10.15168/11572_374590
			
	Lingua (Language)
	
				Inglese
			
	Appare nelle tipologie:
	
				08.1 Tesi di dottorato (Doctoral Thesis)

File in questo prodotto:

File	Dimensione	Formato
phd_unitn_Fini_Enrico.pdf accesso aperto Descrizione: Main Document Tipologia: Tesi di dottorato (Doctoral Thesis) Licenza: Creative commons Dimensione 20.02 MB Formato Adobe PDF Visualizza/Apri	20.02 MB	Adobe PDF	Visualizza/Apri