Autonomously acquiring knowledge and skills interacting with the environment is fundamental for systems operating in real-world scenarios. While the majority of robotics applications consider a limited range of autonomy, research in autonomous open-ended learning is developing systems capable of adapting to contexts that cannot be foreseen at design-time, selecting their own goals and acquiring skills without the need for task-specific motivations or learning signals. In this perspective, the autonomous learning of interdependent tasks is still an open issue, especially in scenarios where dependencies are non-stationary and the system is forced to re-modulate previously acquired knowledge. Here we investigate these issues, presenting a hierarchical architecture allowing both the autonomous learning of multiple goals, and the sequences reflecting the relations between them. Leveraging the capability of the robot to self-monitor its competence, we endowed the system with a mechanism allowing the autonomous balancing of exploration/exploitation strategies to cope with the non-stationarity of the environment. We implemented the proposed architecture in a UR5e robot and tested it in a scenario with different manipulation tasks with changing interdependencies. Results show the effectiveness of our solution, paving the way to its application to more realistic scenarios involving compound objects and unstructured conditions changing over time.

Autonomous Learning of Task Sequences in Robotic Scenarios with Non-Stationary Interdependencies

Baldassarre, Gianluca;Santucci, Vieri Giuliano
2023

Abstract

Autonomously acquiring knowledge and skills interacting with the environment is fundamental for systems operating in real-world scenarios. While the majority of robotics applications consider a limited range of autonomy, research in autonomous open-ended learning is developing systems capable of adapting to contexts that cannot be foreseen at design-time, selecting their own goals and acquiring skills without the need for task-specific motivations or learning signals. In this perspective, the autonomous learning of interdependent tasks is still an open issue, especially in scenarios where dependencies are non-stationary and the system is forced to re-modulate previously acquired knowledge. Here we investigate these issues, presenting a hierarchical architecture allowing both the autonomous learning of multiple goals, and the sequences reflecting the relations between them. Leveraging the capability of the robot to self-monitor its competence, we endowed the system with a mechanism allowing the autonomous balancing of exploration/exploitation strategies to cope with the non-stationarity of the environment. We implemented the proposed architecture in a UR5e robot and tested it in a scenario with different manipulation tasks with changing interdependencies. Results show the effectiveness of our solution, paving the way to its application to more realistic scenarios involving compound objects and unstructured conditions changing over time.
2023
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Autonomous Robotics
Intrinsic Motivations
Reinforcement Learning
Real Robots
File in questo prodotto:
File Dimensione Formato  
ICAR2023_finalversion.pdf

solo utenti autorizzati

Descrizione: A. Romero, G. Baldassarre, R. J. Duro and V. G. Santucci, "Autonomous Learning of Task Sequences in Robotic Scenarios with Non-Stationary Interdependencies," 2023 21st International Conference on Advanced Robotics (ICAR), Abu Dhabi, United Arab Emirates, 2023, pp. 64-70, doi: 10.1109/ICAR58858.2023.10406757.
Tipologia: Documento in Post-print
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 4.38 MB
Formato Adobe PDF
4.38 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/517161
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact