Learning Neural Search Policies for Classical Planning

IRIS

Heuristic forward search is currently the dominant paradigm in classical planning. Forward search algorithms typically rely on a single, relatively simple variation of best-first search and remain fixed throughout the process of solving a planning problem. Existing work combining multiple search techniques usually aims at supporting best-first search with an additional exploratory mechanism, triggered using a handcrafted criterion. A notable exception is very recent work which combines various search techniques using a trainable policy. That approach, however, is confined to a discrete action space comprising several fixed subroutines. In this paper, we introduce a parametrized search algorithm template which combines various search techniques within a single routine. The template's parameter space defines an infinite space of search algorithms, including, among others, BFS, local and random search. We then propose a neural architecture for designating the values of the search parameters given the state of the search. This enables expressing neural search policies that change the values of the parameters as the search progresses. The policies can be learned automatically, with the objective of maximizing the planner's performance on a given distribution of planning problems. We consider a training setting based on a stochastic optimization algorithm known as the cross-entropy method (CEM). Experimental evaluation of our approach shows that it is capable of finding effective distribution-specific search policies, outperforming the relevant baselines.

Learning Neural Search Policies for Classical Planning / Gomoluch, P., Alrajeh, D., Russo, A., Bucchiarone, A.. - ELETTRONICO. - 30:(2020), pp. 522-530. (Thirtieth International Conference on Automated Planning and Scheduling (ICAPS) Nancy, France October 26-30, 2020).

Learning Neural Search Policies for Classical Planning

Gomoluch Pawel;Alrajeh Dalal;Russo Alessandra;Bucchiarone Antonio

2020-01-01

Abstract

Heuristic forward search is currently the dominant paradigm in classical planning. Forward search algorithms typically rely on a single, relatively simple variation of best-first search and remain fixed throughout the process of solving a planning problem. Existing work combining multiple search techniques usually aims at supporting best-first search with an additional exploratory mechanism, triggered using a handcrafted criterion. A notable exception is very recent work which combines various search techniques using a trainable policy. That approach, however, is confined to a discrete action space comprising several fixed subroutines. In this paper, we introduce a parametrized search algorithm template which combines various search techniques within a single routine. The template's parameter space defines an infinite space of search algorithms, including, among others, BFS, local and random search. We then propose a neural architecture for designating the values of the search parameters given the state of the search. This enables expressing neural search policies that change the values of the parameters as the search progresses. The policies can be learned automatically, with the objective of maximizing the planner's performance on a given distribution of planning problems. We consider a training setting based on a stochastic optimization algorithm known as the cross-entropy method (CEM). Experimental evaluation of our approach shows that it is capable of finding effective distribution-specific search policies, outperforming the relevant baselines.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2020
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling (ICAPS)
			
	Luogo di edizione (Place of publication)
	
				California
			
	Casa editrice (Publisher)
	
				AAAI PRESS
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85088512109
			
	Tutti gli autori
	
						Gomoluch, Pawel; Alrajeh, Dalal; Russo, Alessandra; Bucchiarone, Antonio
					
	Citazione
	
				Learning Neural Search Policies for Classical Planning / Gomoluch, P., Alrajeh, D., Russo, A., Bucchiarone, A.. - ELETTRONICO. - 30:(2020), pp. 522-530. (Thirtieth International Conference on Automated Planning and Scheduling (ICAPS) Nancy, France October 26-30, 2020).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/343559

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

2

ND

ND

social impact