CNR Institutional Research Information System

An Italian speaker-independent continuous-speech digit recognizer is described. The CSLU Toolkit was used to develop and implement the system. In the first set of experiments, the SPK-IRST corpus, a collection of digit sentences recorded in a clean environment, was used both for training and testing the system. In the second set, a band-filtered version (between 300 Hz and 3400 Hz) of the SPK-IRST corpus was considered for training, while the telephone PANDA-CSELT corpus was used for testing the system. A hybrid HMM/NN architecture was applied; in this architecture, a three-layer neural network is used as a state emission probability estimator and the conventional forward-backward algorithm is applied for estimating continuous targets for the NN training patterns. The final network, trained to estimate the probability of 116 contextdependent phonetic categories at every 10-msec frame, was not trained on binary target values, but on the probabilities of each phonetic category belonging to each frame. Training and testing will be described in detail and recognition results will be illustrated.

HMM/Neural Network-Based System for Italian Conituous Digit Recognition

Cosi P;Hosom JP

1999

Abstract

An Italian speaker-independent continuous-speech digit recognizer is described. The CSLU Toolkit was used to develop and implement the system. In the first set of experiments, the SPK-IRST corpus, a collection of digit sentences recorded in a clean environment, was used both for training and testing the system. In the second set, a band-filtered version (between 300 Hz and 3400 Hz) of the SPK-IRST corpus was considered for training, while the telephone PANDA-CSELT corpus was used for testing the system. A hybrid HMM/NN architecture was applied; in this architecture, a three-layer neural network is used as a state emission probability estimator and the conventional forward-backward algorithm is applied for estimating continuous targets for the NN training patterns. The final network, trained to estimate the probability of 116 contextdependent phonetic categories at every 10-msec frame, was not trained on binary target values, but on the probabilities of each phonetic category belonging to each frame. Training and testing will be described in detail and recognition results will be illustrated.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				1999
			
	Strutture organizzative
	
				Istituto di Scienze e Tecnologie della Cognizione - ISTC
Istituto di Scienze e Tecnologie della Cognizione - ISTC
			
	Lingua/e
	
				Inglese
			
	Titolo del Volume
	
				Proceedings of ICPhS-99 - XIV International Congress of Phonetic Sciences
			
	Titolo del convegno
	
				ICPhS-99 - XIV International Congress of Phonetic Sciences
			
	Da pagina
	
				1669
			
	A pagina
	
				1672
			
	URL
	
				http://www2.pd.istc.cnr.it/Papers/PieroCosi/cp-ICPhS99.pdf
			
	Nome Editore
	
				American Institute of Physics
			
	Città Editore
	
				Melville [NY]
			
	Nazione Editore
	
				STATI UNITI D'AMERICA
			
	Referee
	
				Sì, ma tipo non specificato
			
	Periodo del Convegno
	
				14-18 August, 1999
			
	Luogo del Convegno
	
				San Francisco, California, USA
			
	Altre informazioni
	
				Cosi P., Hosom J.P.
"HMM/Neural Network-Based System for Italian Conituous Digit Recognition"
Proceedings XIV International Congress of Phonetic Sciences, ICPhS-99
San Francisco, California, USA
14-18 August, 1999
pp. 1669-1672

http://www2.pd.istc.cnr.it/Papers/PieroCosi/cp-ICPhS99.pdf
			
	Numero autori
	
				2
			
	Fulltext
	
				none
			
	Tutti gli autori
	
						Cosi, P; Hosom, Jp
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/18614

Citazioni

ND

ND

ND

social impact