Learning with delayed synaptic plasticity

IRIS

The plasticity property of biological neural networks allows them to perform learning and optimize their behavior by changing their configuration. Inspired by biology, plasticity can be modeled in artificial neural networks by using Hebbian learning rules, i.e. rules that update synapses based on the neuron activations and reinforcement signals. However, the distal reward problem arises when the reinforcement signals are not available immediately after each network output to associate the neuron activations that contributed to receiving the reinforcement signal. In this work, we extend Hebbian plasticity rules to allow learning in distal reward cases. We propose the use of neuron activation traces (NATs) to provide additional data storage in each synapse to keep track of the activation of the neurons. Delayed reinforcement signals are provided after each episode relative to the networks' performance during the previous episode. We employ genetic algorithms to evolve delayed synaptic plasticity (DSP) rules and perform synaptic updates based on NATs and delayed reinforcement signals. We compare DSP with an analogous hill climbing algorithm that does not incorporate domain knowledge introduced with the NATs, and show that the synaptic updates performed by the DSP rules demonstrate more effective training performance relative to the HC algorithm.

Learning with delayed synaptic plasticity / Yaman, A.; Iacca, Giovanni; Mocanu, D. C.; Fletcher, G.; Pechenizkiy, M.. - (2019), pp. 152-160. (Intervento presentato al convegno Genetic and Evolutionary Computation Conference (GECCO) tenutosi a Prague nel 13th- 17th July 2019) [10.1145/3321707.3321723].

Learning with delayed synaptic plasticity

Yaman A.;Iacca, Giovanni;Mocanu D. C.;Fletcher G.;Pechenizkiy M.

2019-01-01

Abstract

The plasticity property of biological neural networks allows them to perform learning and optimize their behavior by changing their configuration. Inspired by biology, plasticity can be modeled in artificial neural networks by using Hebbian learning rules, i.e. rules that update synapses based on the neuron activations and reinforcement signals. However, the distal reward problem arises when the reinforcement signals are not available immediately after each network output to associate the neuron activations that contributed to receiving the reinforcement signal. In this work, we extend Hebbian plasticity rules to allow learning in distal reward cases. We propose the use of neuron activation traces (NATs) to provide additional data storage in each synapse to keep track of the activation of the neurons. Delayed reinforcement signals are provided after each episode relative to the networks' performance during the previous episode. We employ genetic algorithms to evolve delayed synaptic plasticity (DSP) rules and perform synaptic updates based on NATs and delayed reinforcement signals. We compare DSP with an analogous hill climbing algorithm that does not incorporate domain knowledge introduced with the NATs, and show that the synaptic updates performed by the DSP rules demonstrate more effective training performance relative to the HC algorithm.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2019
			
	Titolo del volume (Proceedings title)
	
				Genetic and Evolutionary Computation Conference
			
	Luogo di edizione (Place of publication)
	
				New York
			
	Casa editrice (Publisher)
	
				ACM
			
	ISBN
	
				9781450361118
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85072338773
			
	Codice WOS (WOS identifier)
	
				WOS:000523218400021
			
	Tutti gli autori
	
						Yaman, A.; Iacca, Giovanni; Mocanu, D. C.; Fletcher, G.; Pechenizkiy, M.
					
	Citazione
	
				Learning with delayed synaptic plasticity / Yaman, A.; Iacca, Giovanni; Mocanu, D. C.; Fletcher, G.; Pechenizkiy, M.. - (2019), pp. 152-160. (Intervento presentato al  convegno Genetic and Evolutionary Computation Conference (GECCO) tenutosi a Prague nel 13th- 17th July 2019) [10.1145/3321707.3321723].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
Learning_with_Delayed_Synaptic_Plasticity.pdf accesso aperto Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.32 MB Formato Adobe PDF Visualizza/Apri	1.32 MB	Adobe PDF	Visualizza/Apri
3321707.3321723.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.24 MB Formato Adobe PDF Visualizza/Apri	1.24 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/251757

Citazioni

ND

5

3

ND

social impact