Fall resilience remains a significant challenge for robots capable of locomotion. In the case of humanoid robots, arms can be utilized to mitigate impact damage, thereby improving resilience to falls. Although previous work has primarily focused on damage reduction strategies for legged robots, the question of minimizing fall damage in humanoid ballbots remains largely unexplored. In this paper, we introduce a novel multi-policy approach, combining clustering on the initial pose and Reinforcement Learning, to reduce damage resulting from a fall in a commercial humanoid ballbot. We conduct simulations to compare our method against three baselines: (1) a curriculum learning policy, (2) a single-policy approach enhanced by clustering information, and (3) a standard single-policy baseline. Experimental results demonstrate that our approach achieves higher success rates and greater damage reduction compared to existing alternatives.
A Multi-policy Approach Based on Clustering for Minimizing the Damage on a Falling Ballbot / Buzzetti, Giulia; Aractingi, Michel; Zappetti, Davide; Iacca, Giovanni. - (2026), pp. 2251-2256. ( 2025 11th International Conference on Control, Decision and Information Technologies (CoDIT) Split, Croatia 15th July-18th July 2025) [10.1109/codit66093.2025.11321766].
A Multi-policy Approach Based on Clustering for Minimizing the Damage on a Falling Ballbot
Buzzetti, Giulia;Iacca, Giovanni
2026-01-01
Abstract
Fall resilience remains a significant challenge for robots capable of locomotion. In the case of humanoid robots, arms can be utilized to mitigate impact damage, thereby improving resilience to falls. Although previous work has primarily focused on damage reduction strategies for legged robots, the question of minimizing fall damage in humanoid ballbots remains largely unexplored. In this paper, we introduce a novel multi-policy approach, combining clustering on the initial pose and Reinforcement Learning, to reduce damage resulting from a fall in a commercial humanoid ballbot. We conduct simulations to compare our method against three baselines: (1) a curriculum learning policy, (2) a single-policy approach enhanced by clustering information, and (3) a standard single-policy baseline. Experimental results demonstrate that our approach achieves higher success rates and greater damage reduction compared to existing alternatives.| File | Dimensione | Formato | |
|---|---|---|---|
|
A_Multi-policy_Approach_Based_on_Clustering_for_Minimizing_the_Damage_on_a_Falling_Ballbot.pdf
Solo gestori archivio
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.68 MB
Formato
Adobe PDF
|
2.68 MB | Adobe PDF | Visualizza/Apri |
|
_CODIT25_0514_FI.pdf
accesso aperto
Tipologia:
Post-print referato (Refereed author’s manuscript)
Licenza:
Creative commons
Dimensione
1.41 MB
Formato
Adobe PDF
|
1.41 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione



