Latent World Models For Intrinsically Motivated Exploration

Ermolov, Aleksandr; Sebe, Nicu

In this work we consider partially observable environments with sparse rewards. We present a self-supervised representation learning method for image-based observations, which arranges embeddings respecting temporal distance of observations. This representation is empirically robust to stochasticity and suitable for novelty detection from the error of a predictive forward model. We consider episodic and life-long uncertainties to guide the exploration. We propose to estimate the missing information about the environment with the world model, which operates in the learned latent space. As a motivation of the method, we analyse the exploration problem in a tabular Partially Observable Labyrinth. We demonstrate the method on image-based hard exploration environments from the Atari benchmark and report significant improvement with respect to prior work. The source code of the method and all the experiments is available at https://github.com/htdt/lwm.

Latent World Models For Intrinsically Motivated Exploration / Ermolov, Aleksandr; Sebe, Nicu. - 2020-:(2020). (Intervento presentato al convegno 34th Conference on Neural Information Processing Systems, NeurIPS 2020 tenutosi a online nel 6th-12th December 2020).

Latent World Models For Intrinsically Motivated Exploration

Ermolov, Aleksandr;Sebe, Nicu

2020-01-01

Abstract

In this work we consider partially observable environments with sparse rewards. We present a self-supervised representation learning method for image-based observations, which arranges embeddings respecting temporal distance of observations. This representation is empirically robust to stochasticity and suitable for novelty detection from the error of a predictive forward model. We consider episodic and life-long uncertainties to guide the exploration. We propose to estimate the missing information about the environment with the world model, which operates in the learned latent space. As a motivation of the method, we analyse the exploration problem in a tabular Partially Observable Labyrinth. We demonstrate the method on image-based hard exploration environments from the Atari benchmark and report significant improvement with respect to prior work. The source code of the method and all the experiments is available at https://github.com/htdt/lwm.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2020
			
	Titolo del volume (Proceedings title)
	
				Advances in Neural Information Processing Systems 33
			
	Luogo di edizione (Place of publication)
	
				San Diego
			
	Casa editrice (Publisher)
	
				Neural Information Processing Systems
			
	ISBN
	
				9781713829546
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85108442662
			
	Codice WOS (WOS identifier)
	
				WOS:001207690601039
			
	Tutti gli autori
	
						Ermolov, Aleksandr; Sebe, Nicu
					
	Citazione
	
				Latent World Models For Intrinsically Motivated Exploration / Ermolov, Aleksandr; Sebe, Nicu. - 2020-:(2020). (Intervento presentato al  convegno 34th Conference on Neural Information Processing Systems, NeurIPS 2020 tenutosi a online nel 6th-12th December 2020).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
NeurIPS-2020-latent-world-models-for-intrinsically-motivated-exploration-Paper.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 553.96 kB Formato Adobe PDF Visualizza/Apri	553.96 kB	Adobe PDF	Visualizza/Apri