CNR Institutional Research Information System

We demonstrate a sample-efficient method for constructing reusable parameterized skills that can solve families of related motor tasks. Our method uses learned policies to analyze the policy space topology and learn a set of regression models which, given a novel task, appropriately parameterizes an underlying low-level controller. By identifying the disjoint charts that compose the policy manifold, the method can separately model the qualitatively different sub-skills required for solving distinct classes of tasks. Such sub-skills are useful because they can be treated as new discrete, specialized actions by higher-level planning processes. We also propose a method for reusing seemingly unsuccessful policies as additional, valid training samples for synthesizing the skill, thus accelerating learning. We evaluate our method on a humanoid iCub robot tasked with learning to accurately throw plastic balls at parameterized target locations.

Learning Parameterized Motor Skills on a Humanoid Robot

Castro da Silva Bruno Castro;Baldassarre Gianluca;Konidaris George;Barto Andrew

2014

Abstract

We demonstrate a sample-efficient method for constructing reusable parameterized skills that can solve families of related motor tasks. Our method uses learned policies to analyze the policy space topology and learn a set of regression models which, given a novel task, appropriately parameterizes an underlying low-level controller. By identifying the disjoint charts that compose the policy manifold, the method can separately model the qualitatively different sub-skills required for solving distinct classes of tasks. Such sub-skills are useful because they can be treated as new discrete, specialized actions by higher-level planning processes. We also propose a method for reusing seemingly unsuccessful policies as additional, valid training samples for synthesizing the skill, thus accelerating learning. We evaluate our method on a humanoid iCub robot tasked with learning to accurately throw plastic balls at parameterized target locations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2014
			
	Strutture organizzative
	
				Istituto di Scienze e Tecnologie della Cognizione - ISTC
			
	Lingua/e
	
				Inglese
			
	Serie
	
				PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION
			
	Titolo del convegno
	
				IEEE International Conference on Robotics and Automation (ICRA2014)
			
	Da pagina
	
				5239
			
	A pagina
	
				5244
			
	Numero di pagine
	
				6
			
	Referee
	
				Sì, ma tipo non specificato
			
	Periodo del Convegno
	
				31 May - 7 June 2014
			
	Luogo del Convegno
	
				Hong Kong, China
			
	Parole chiave
	
				Robotics
Artificial Intelligence
Neural networks
Autonomous learning
			
	Altre informazioni
	
				Video of the robot: http://www.youtube.com/watch?v=BLt3GmjDN1o
			
	Codice Web of Science
	
				WOS:000377221105043
			
	Numero autori
	
				4
			
	Fulltext
	
				none
			
	Tutti gli autori
	
						Castro da Silva Bruno, Castro; Baldassarre, Gianluca; Konidaris, George; Barto, Andrew
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/380049

Citazioni

ND

ND

24

social impact