Although the spontaneous origins of concepts from interaction is often given for granted, how the process can start without a fully developed sensorimotor representation system has not been sufficiently explored. Here, we offer a new hypothesis for a mechanism supporting concept formation while learning to perceive and act intentionally. We specify an architecture in which multi-modal sensory patterns are mapped in the same lower-dimensional representation space. The motor repertoire is also represented in the same space via topological mapping. We posit that the acquisition of these mappings can be mutually constrained by maximizing the convergence between sensory and motor representations during online interaction. This learning signal reflects an intrinsic motivation of competence acquisition. We propose that topological alignment via competence acquisition eventually results in a sensorimotor representation system. To assess the consistency of this hypothesis, we develop a computational model and test it in an object manipulation task. Results show that such an intrinsically motivated learning process can create a cross-modal categorization system with semantic content, which supports perception and intentional action selection, which has the resources to re-enact its own multi-modal experiences, and, on this basis, to kick-start the formation of concepts grounded in the external environment. This article is part of the theme issue 'Concepts in interaction: social engagement and inner experiences'.
Kick-starting concept formation with intrinsically motivated learning: The grounding by competence acquisition hypothesis
Mannella F.;Tummolini L.
2023
Abstract
Although the spontaneous origins of concepts from interaction is often given for granted, how the process can start without a fully developed sensorimotor representation system has not been sufficiently explored. Here, we offer a new hypothesis for a mechanism supporting concept formation while learning to perceive and act intentionally. We specify an architecture in which multi-modal sensory patterns are mapped in the same lower-dimensional representation space. The motor repertoire is also represented in the same space via topological mapping. We posit that the acquisition of these mappings can be mutually constrained by maximizing the convergence between sensory and motor representations during online interaction. This learning signal reflects an intrinsic motivation of competence acquisition. We propose that topological alignment via competence acquisition eventually results in a sensorimotor representation system. To assess the consistency of this hypothesis, we develop a computational model and test it in an object manipulation task. Results show that such an intrinsically motivated learning process can create a cross-modal categorization system with semantic content, which supports perception and intentional action selection, which has the resources to re-enact its own multi-modal experiences, and, on this basis, to kick-start the formation of concepts grounded in the external environment. This article is part of the theme issue 'Concepts in interaction: social engagement and inner experiences'.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.