Curriculum Reinforcement Learning (CRL) is an approach to facilitate the learning process of agents by structuring tasks in a sequence of increasing complexity. Despite its potential, many existing CRL methods struggle to efficiently guide agents toward desired outcomes, particularly in the absence of domain knowledge. This paper introduces DiCuRL (Diffusion Curriculum Reinforcement Learning), a novel method that leverages conditional diffusion models to generate curriculum goals. To estimate how close an agent is to achieving its goal, our method uniquely incorporates a Q-function and a trainable reward function based on Adversarial Intrinsic Motivation within the diffusion model. Furthermore, it promotes exploration through the inherent noising and denoising mechanism present in the diffusion models and is environment-agnostic. This combination allows for the generation of challenging yet achievable goals, enabling agents to learn effectively without relying on domain knowledge. We d...

Diffusion-based Curriculum Reinforcement Learning / Sayar, Erdi; Iacca, Giovanni; Oguz, Ozgur S.; Knoll, Alois. - 37:(2024). ( 38th Conference on Neural Information Processing Systems, NeurIPS 2024 Vancouver 10th Dic 2024-15th Dec 2024).

Diffusion-based Curriculum Reinforcement Learning

Iacca, Giovanni;
2024-01-01

Abstract

Curriculum Reinforcement Learning (CRL) is an approach to facilitate the learning process of agents by structuring tasks in a sequence of increasing complexity. Despite its potential, many existing CRL methods struggle to efficiently guide agents toward desired outcomes, particularly in the absence of domain knowledge. This paper introduces DiCuRL (Diffusion Curriculum Reinforcement Learning), a novel method that leverages conditional diffusion models to generate curriculum goals. To estimate how close an agent is to achieving its goal, our method uniquely incorporates a Q-function and a trainable reward function based on Adversarial Intrinsic Motivation within the diffusion model. Furthermore, it promotes exploration through the inherent noising and denoising mechanism present in the diffusion models and is environment-agnostic. This combination allows for the generation of challenging yet achievable goals, enabling agents to learn effectively without relying on domain knowledge. We d...
2024
Advances in Neural Information Processing Systems 37 (NeurIPS 2024)
Vancouver
NeurIPS Foundation
9798331314385
Sayar, Erdi; Iacca, Giovanni; Oguz, Ozgur S.; Knoll, Alois
Diffusion-based Curriculum Reinforcement Learning / Sayar, Erdi; Iacca, Giovanni; Oguz, Ozgur S.; Knoll, Alois. - 37:(2024). ( 38th Conference on Neural Information Processing Systems, NeurIPS 2024 Vancouver 10th Dic 2024-15th Dec 2024).
File in questo prodotto:
File Dimensione Formato  
11023_Diffusion_based_Curricul.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 8.1 MB
Formato Adobe PDF
8.1 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/438653
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex 1
social impact