This dataset contains a corpus of audio accented speech stimuli in Italian, developed within the CIRCE project for use in perceptual experiments on accent perception and discrimination. The corpus includes 25 recordings from both male and female speakers who are native speakers of Italian (L1) as well as speakers who acquired Italian as a second language (L2). 8 accents of Italian as L1 and 3 accents of Italian as L2 are represented. All speakers read the same standardized text; individual recordings have an approximate duration of 23 seconds and are organized by speakers’ linguistic background (L1/L2). The dataset is designed to support matched-guise and verbal-guise experiments, as well as research and educational activities addressing accent perception, linguistic diversity, and accent discrimination.
CIRCE Accented Italian Speech Stimuli Corpus
CLAUDIA SORIA
2026
Abstract
This dataset contains a corpus of audio accented speech stimuli in Italian, developed within the CIRCE project for use in perceptual experiments on accent perception and discrimination. The corpus includes 25 recordings from both male and female speakers who are native speakers of Italian (L1) as well as speakers who acquired Italian as a second language (L2). 8 accents of Italian as L1 and 3 accents of Italian as L2 are represented. All speakers read the same standardized text; individual recordings have an approximate duration of 23 seconds and are organized by speakers’ linguistic background (L1/L2). The dataset is designed to support matched-guise and verbal-guise experiments, as well as research and educational activities addressing accent perception, linguistic diversity, and accent discrimination.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


