CNR Institutional Research Information System

Intrinsically Motivated Reinforcement Learning (IMRL) has been proposed as a framework within which agents exploit "internal reinforcement" to acquire general-purpose building-block behaviors ("skills") which can be later combined for solving several specific tasks. The architectures so far proposed within this framework are limited in that: (1) they use hardwired "salient events" to form and train skills, and this limits agents' autonomy; (2) they are applicable only to problems with abstract states and actions, as grid-world problems. This paper proposes solutions to these problems in the form of a hierarchical reinforcement-learning architecture that: (1) exploits the ideas and techniques of Evolutionary Robotics to allow the system to autonomously discover "salient events"; (2) uses neural networks to allow the system to cope with continuous states and noisy environments. The paper also starts to explore a new way of producing intrinsic motivations on the basis of the learning progress of skills. The viability of the proposed approach is demonstrated with a simulated robotic scenario.

Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot

Schembri M;Mirolli M;Baldassarre G

2007

Abstract

Intrinsically Motivated Reinforcement Learning (IMRL) has been proposed as a framework within which agents exploit "internal reinforcement" to acquire general-purpose building-block behaviors ("skills") which can be later combined for solving several specific tasks. The architectures so far proposed within this framework are limited in that: (1) they use hardwired "salient events" to form and train skills, and this limits agents' autonomy; (2) they are applicable only to problems with abstract states and actions, as grid-world problems. This paper proposes solutions to these problems in the form of a hierarchical reinforcement-learning architecture that: (1) exploits the ideas and techniques of Evolutionary Robotics to allow the system to autonomously discover "salient events"; (2) uses neural networks to allow the system to cope with continuous states and noisy environments. The paper also starts to explore a new way of producing intrinsic motivations on the basis of the learning progress of skills. The viability of the proposed approach is demonstrated with a simulated robotic scenario.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2007
			
	Strutture organizzative
	
				Istituto di Scienze e Tecnologie della Cognizione - ISTC
			
	Lingua/e
	
				Inglese
			
	Supervisori e coordinatori esterni
	
				Demiris Y.;  Scassellati B.; Mareschal D.
			
	Titolo del Volume
	
				IEEE 6th International Conference on Development and Learning (ICDL2007)
			
	Da pagina
	
				282
			
	A pagina
	
				287
			
	Numero di pagine
	
				6
			
	Codice ISBN
	
				978-1-4244-1116-0
			
	Codice DOI
	
				https://dx.doi.org/10.1109/DEVLRN.2007.4354052
			
	URL
	
				http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=4354052
			
	Nome Editore
	
				Imperial College
			
	Città Editore
	
				London
			
	Nazione Editore
	
				REGNO UNITO DI GRAN BRETAGNA
			
	Referee
	
				Sì, ma tipo non specificato
			
	Codice Scopus
	
				2-s2.0-50849094213
			
	Codice Web of Science
	
				WOS:000253369100031
			
	Numero autori
	
				3
			
	Tipologia
	
				02 Contributo in Volume::02.01 Contributo in volume (Capitolo o Saggio)
			
	Tipologia Login Miur
	
				268
			
	Fulltext
	
				none
			
	Tutti gli autori
	
						Schembri M.; Mirolli M.; Baldassarre G.
					
	Tipologia
	
				info:eu-repo/semantics/bookPart
			
	Appare nelle tipologie:
	
				02.01 Contributo in volume (Capitolo o Saggio)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/129545

Citazioni

ND

54

19

social impact