CNR Institutional Research Information System

The Q-learning algorithm, for its semplicity and well-developped theory, has been largely used in the last years in order to realize different behaviors for autonomous vehicles. The most frequent applications required the standard tabular formulation with discrete sets of state and action. In order to consider continuos variables, function approximators such as neural networks are required. In this work we investigate the neural approach of Q-learning on the robot navigation task of wall following. Some issues have been addressed in order to deal with the convergence problem and the need of huge training sets. The experience replay paradigm has been also applied to reduce the unlearning problem. Two different neural network architectures which use different spatial decompositions of the sensory input have been compared. The aim is to investigate how different choices of architecture can affect the learning convergence, the optimality of the final controller and the generalization ability.

Neural Q-learning control architectures for a wall-following behavior

Cicirelli G;D'Orazio T;Distante A

2003

Abstract

The Q-learning algorithm, for its semplicity and well-developped theory, has been largely used in the last years in order to realize different behaviors for autonomous vehicles. The most frequent applications required the standard tabular formulation with discrete sets of state and action. In order to consider continuos variables, function approximators such as neural networks are required. In this work we investigate the neural approach of Q-learning on the robot navigation task of wall following. Some issues have been addressed in order to deal with the convergence problem and the need of huge training sets. The experience replay paradigm has been also applied to reduce the unlearning problem. Two different neural network architectures which use different spatial decompositions of the sensory input have been compared. The aim is to investigate how different choices of architecture can affect the learning convergence, the optimality of the final controller and the generalization ability.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2003
			
	Strutture organizzative
	
				Istituto di Studi sui Sistemi Intelligenti per l'Automazione - ISSIA - Sede Bari
			
	Parole chiave
	
				reinforcement learning
neural Q-learning
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/66657

Citazioni

ND

6

ND

social impact