This work represents a joint collaboration between the Center for Spoken Language Research (CSLR) at the University of Colorado and the Institute of Cognitive Sciences and Technologies of the National Research Council located in Padova Italy. This work was conducted with the specific goals of developing improved recognition of children's speech in Italian and the installation and integration of the children's speech recognition models into the Italian Literacy Tutor system. Specifically, children's speech recognition research for Italian was conducted using the ITC-irst Children's Speech Corpus (Giuliani & Gerosa, 2003). Using the University of Colorado SONIC large vocabulary speech recognition system, we demonstrate a phonetic recognition error rate of 13.5% for a system which incorporates Vocal Tract Length Normalization (VTLN), Cepstral variance normalization, Speaker-Adaptive Trained phonetic models, as well as iterative unsupervised Structural MAP Linear Regression (SMAPLR). These new acoustic models have been incorporated within an Italian version of the Colorado Literacy Tutor system.

Italian Children Speech Recognition with Application to Interactive Books and Tutors

Piero Cosi;
2005

Abstract

This work represents a joint collaboration between the Center for Spoken Language Research (CSLR) at the University of Colorado and the Institute of Cognitive Sciences and Technologies of the National Research Council located in Padova Italy. This work was conducted with the specific goals of developing improved recognition of children's speech in Italian and the installation and integration of the children's speech recognition models into the Italian Literacy Tutor system. Specifically, children's speech recognition research for Italian was conducted using the ITC-irst Children's Speech Corpus (Giuliani & Gerosa, 2003). Using the University of Colorado SONIC large vocabulary speech recognition system, we demonstrate a phonetic recognition error rate of 13.5% for a system which incorporates Vocal Tract Length Normalization (VTLN), Cepstral variance normalization, Speaker-Adaptive Trained phonetic models, as well as iterative unsupervised Structural MAP Linear Regression (SMAPLR). These new acoustic models have been incorporated within an Italian version of the Colorado Literacy Tutor system.
2005
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Istituto di Scienze e Tecnologie della Cognizione - ISTC
Inglese
AISV 2004, 1° Convegno Nazionale AISV, Associazione Italiana di Scienze della Voce, "Misura dei parametri - aspetti tecnologici ed implicazioni nei modelli linguistici"
89 (CD Rom 807)
89 (CD Rom 816)
104
88-88974-69-5
http://www.aisv.it/AISV2004/default.htm
EDK Editore
Torriana
ITALIA
Sì, ma tipo non specificato
2-4 dicembre 2004
Padova, Italy
Cosi P., B. Pellom B "Italian Children's Speech Recognition and Interactive Books and Tutors" in Cosi P. (editor) Abstract Book & CD-Rom Proceedings of AISV 2004, 1st Conference of Associazione Italiana di Scienze della Voce Padova, Italy December 2-4, 2004 EDK Editore s.r.l. Padova, 2005 pp. 807-816 (89) http://www.pd.istc.cnr.it/Papers/PieroCosi/cp-AISV2004.pdf
2
none
Cosi, Piero; Pellom, Bryan
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/17928
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact