We consider a complex control problem: making a monopod accurately reach a target with a single jump. The monopod can jump in any direction at different elevations of the terrain. This is a paradigm for a much larger class of problems, which are extremely challenging and computationally expensive to solve using standard optimisation-based techniques. Reinforcement Learning (RL) is an interesting alternative, but an end-to-end approach in which the controller must learn everything from scratch can be nontrivial with a sparse-reward task like jumping. Our solution is to guide the learning process within an RL framework leveraging nature-inspired heuristic knowledge. This expedient brings widespread benefits, such as a drastic reduction of learning time, and the ability to learn and compensate for possible errors in the low-level execution of the motion. Our simulation results reveal a clear advantage of our solution against both optimisation-based and end-to-end RL approaches.

Efficient Reinforcement Learning for 3D Jumping Monopods / Bussola, Riccardo; Focchi, Michele; Del Prete, Andrea; Fontanelli, Daniele; Palopoli, Luigi. - In: SENSORS. - ISSN 1424-8220. - 24:15(2024). [10.3390/s24154981]

Efficient Reinforcement Learning for 3D Jumping Monopods

Focchi, Michele
Co-primo
;
Del Prete, Andrea
Co-ultimo
;
Fontanelli, Daniele
Co-ultimo
;
Palopoli, Luigi
Co-ultimo
2024-01-01

Abstract

We consider a complex control problem: making a monopod accurately reach a target with a single jump. The monopod can jump in any direction at different elevations of the terrain. This is a paradigm for a much larger class of problems, which are extremely challenging and computationally expensive to solve using standard optimisation-based techniques. Reinforcement Learning (RL) is an interesting alternative, but an end-to-end approach in which the controller must learn everything from scratch can be nontrivial with a sparse-reward task like jumping. Our solution is to guide the learning process within an RL framework leveraging nature-inspired heuristic knowledge. This expedient brings widespread benefits, such as a drastic reduction of learning time, and the ability to learn and compensate for possible errors in the low-level execution of the motion. Our simulation results reveal a clear advantage of our solution against both optimisation-based and end-to-end RL approaches.
2024
15
Bussola, Riccardo; Focchi, Michele; Del Prete, Andrea; Fontanelli, Daniele; Palopoli, Luigi
Efficient Reinforcement Learning for 3D Jumping Monopods / Bussola, Riccardo; Focchi, Michele; Del Prete, Andrea; Fontanelli, Daniele; Palopoli, Luigi. - In: SENSORS. - ISSN 1424-8220. - 24:15(2024). [10.3390/s24154981]
File in questo prodotto:
File Dimensione Formato  
bussola23jumpleg.pdf

Solo gestori archivio

Tipologia: Pre-print non referato (Non-refereed preprint)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 651.87 kB
Formato Adobe PDF
651.87 kB Adobe PDF   Visualizza/Apri
sensors-24-04981.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 609.27 kB
Formato Adobe PDF
609.27 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/433790
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 2
  • OpenAlex ND
social impact