Motivated by transportation tasks on construction sites, this contribution deals with an AI-driven approach to human-robot collaborative transportation. An essential part of the considered problem is navigating the robot to the object to be transported, in the presence of other moving items like the human moving to the object. Robot navigation is tackled with reinforcement learning, and the impact of randomness in the other moving items' behaviour on the robot's training performance is investigated. Results show that the move failure rate of the trained robot policy increases, when the behavioural patterns in the human's movements are disturbed by randomness. On the other hand, when both human and robot are connected to the object, the navigation problem is delegated to the human, which guides the compound human-object-robot to the goal location.
Applying grid world based reinforcement learning to real world collaborative transport / Hammerle, Alexander; Heindl, Christoph; Stubl, Gernot; Thapa, Jenish; Lamon, Edoardo; Pichler, Andreas. - In: PROCEDIA COMPUTER SCIENCE. - ISSN 1877-0509. - 232:(2024), pp. 388-396. (Intervento presentato al convegno 5th International Conference on Industry 4.0 and Smart Manufacturing, ISM 2023 tenutosi a University Institute of Lisbon, prt nel 22-24 November 2023) [10.1016/j.procs.2024.01.038].
Applying grid world based reinforcement learning to real world collaborative transport
Lamon, Edoardo;
2024-01-01
Abstract
Motivated by transportation tasks on construction sites, this contribution deals with an AI-driven approach to human-robot collaborative transportation. An essential part of the considered problem is navigating the robot to the object to be transported, in the presence of other moving items like the human moving to the object. Robot navigation is tackled with reinforcement learning, and the impact of randomness in the other moving items' behaviour on the robot's training performance is investigated. Results show that the move failure rate of the trained robot policy increases, when the behavioural patterns in the human's movements are disturbed by randomness. On the other hand, when both human and robot are connected to the object, the navigation problem is delegated to the human, which guides the compound human-object-robot to the goal location.File | Dimensione | Formato | |
---|---|---|---|
ISM2023.pdf
accesso aperto
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Creative commons
Dimensione
3.71 MB
Formato
Adobe PDF
|
3.71 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione