CNR Institutional Research Information System

This work represents a joint collaboration between the Center for Spoken Language Research (CSLR) at the University of Colorado and the Institute of Cognitive Sciences and Technologies of the National Research Council located in Padova Italy. This work was conducted with the specific goals of developing improved recognition of children's speech in Italian and the installation and integration of the children's speech recognition models into the Italian Literacy Tutor system. Specifically, children's speech recognition research for Italian was conducted using the ITC-irst Children's Speech Corpus (Giuliani & Gerosa, 2003). Using the University of Colorado SONIC large vocabulary speech recognition system, we demonstrate a phonetic recognition error rate of 13.5% for a system which incorporates Vocal Tract Length Normalization (VTLN), Cepstral variance normalization, Speaker-Adaptive Trained phonetic models, as well as iterative unsupervised Structural MAP Linear Regression (SMAPLR). These new acoustic models have been incorporated within an Italian version of the Colorado Literacy Tutor system.

Italian Children Speech Recognition with Application to Interactive Books and Tutors

Piero Cosi;Bryan Pellom

2005

Abstract

This work represents a joint collaboration between the Center for Spoken Language Research (CSLR) at the University of Colorado and the Institute of Cognitive Sciences and Technologies of the National Research Council located in Padova Italy. This work was conducted with the specific goals of developing improved recognition of children's speech in Italian and the installation and integration of the children's speech recognition models into the Italian Literacy Tutor system. Specifically, children's speech recognition research for Italian was conducted using the ITC-irst Children's Speech Corpus (Giuliani & Gerosa, 2003). Using the University of Colorado SONIC large vocabulary speech recognition system, we demonstrate a phonetic recognition error rate of 13.5% for a system which incorporates Vocal Tract Length Normalization (VTLN), Cepstral variance normalization, Speaker-Adaptive Trained phonetic models, as well as iterative unsupervised Structural MAP Linear Regression (SMAPLR). These new acoustic models have been incorporated within an Italian version of the Colorado Literacy Tutor system.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2005
			
	Strutture organizzative
	
				Istituto di Scienze e Tecnologie della Cognizione - ISTC
Istituto di Scienze e Tecnologie della Cognizione - ISTC
			
	Lingua/e
	
				Inglese
			
	Titolo del convegno
	
				AISV 2004, 1° Convegno Nazionale AISV, Associazione Italiana di Scienze della Voce, "Misura dei parametri - aspetti tecnologici ed implicazioni nei modelli linguistici"
			
	Da pagina
	
				89 (CD Rom 807)
			
	A pagina
	
				89 (CD Rom 816)
			
	Numero di pagine
	
				104
			
	Codice ISBN
	
				88-88974-69-5
			
	URL
	
				http://www.aisv.it/AISV2004/default.htm
			
	Nome Editore
	
				EDK Editore
			
	Città Editore
	
				Torriana
			
	Nazione Editore
	
				ITALIA
			
	Referee
	
				Sì, ma tipo non specificato
			
	Periodo del Convegno
	
				2-4 dicembre 2004
			
	Luogo del Convegno
	
				Padova, Italy
			
	Altre informazioni
	
				Cosi P., B. Pellom B
"Italian Children's Speech Recognition and Interactive Books and Tutors"
in Cosi P. (editor)
Abstract Book & CD-Rom Proceedings of AISV 2004, 1st Conference of Associazione Italiana di Scienze della Voce
Padova, Italy
December 2-4, 2004
EDK Editore s.r.l.
Padova, 2005
pp. 807-816 (89)

http://www.pd.istc.cnr.it/Papers/PieroCosi/cp-AISV2004.pdf
			
	Numero autori
	
				2
			
	Fulltext
	
				none
			
	Tutti gli autori
	
						Cosi, Piero; Pellom, Bryan
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/17928

Citazioni

ND

ND

ND

social impact