CNR Institutional Research Information System

A joint synchrony/mean-rate auditory model, recently proposed by Seneff[6], is embedded into a classical DTW-based system for the recognition of Italian digits. Its performances are evalu¬ated in both clean and noisy speech and compared with those of a system based on the Mel¬cepstrum representation. Experimental results show that the Mel representation outperforms the auditory model. Problems encountered by the auditory model in noisy speech are outlined and suggestions for noise compensation techniques both inside and outside the model are given. Simple image processing techniques aiming to clean up the synchrony spectrogram in noisy speech are suggested and some promising preliminary results are presented.

A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition

Cosi P;Falavigna D;Mian GA;Omologo M

1990

Abstract

A joint synchrony/mean-rate auditory model, recently proposed by Seneff[6], is embedded into a classical DTW-based system for the recognition of Italian digits. Its performances are evalu¬ated in both clean and noisy speech and compared with those of a system based on the Mel¬cepstrum representation. Experimental results show that the Mel representation outperforms the auditory model. Problems encountered by the auditory model in noisy speech are outlined and suggestions for noise compensation techniques both inside and outside the model are given. Simple image processing techniques aiming to clean up the synchrony spectrogram in noisy speech are suggested and some promising preliminary results are presented.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				1990
			
	Strutture organizzative
	
				Istituto di Scienze e Tecnologie della Cognizione - ISTC
Istituto di Scienze e Tecnologie della Cognizione - ISTC
			
	Lingua/e
	
				Inglese
			
	Supervisori e coordinatori esterni
	
				Luis Torres, Enrique Masgrau, Miguel A. Lagunas
			
	Titolo del Volume
	
				Proceedings EUSIPCO-90 - Signal Processing: Theory and Applications. Fifth European Signal Processing Conference
			
	Titolo del convegno
	
				Proceedings EUSIPCO-90
			
	Da pagina
	
				1199
			
	A pagina
	
				1202
			
	Numero di pagine
	
				2034
			
	Codice ISBN
	
				0444886362
			
	Nome Editore
	
				North Holland Pub. Co.
Elsevier
			
	Città Editore
	
				Amsterdam
Amsterdam
			
	Nazione Editore
	
				PAESI BASSI
PAESI BASSI
			
	Referee
	
				Sì, ma tipo non specificato
			
	Periodo del Convegno
	
				18-21 September, 1990
			
	Luogo del Convegno
	
				Barcellona, Spain
			
	Parole chiave
	
				Mel-Scale Cepstrum
Auditory Model Representation
Noisy Speech Recognition
			
	Altre informazioni
	
				Cosi P., Falavigna D., Mian G.A., Omologo M.
A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition
Proceedings EUSIPCO-90 - Signal Processing: Theory and Applications. Fifth European Signal Processing Conference
Barcellona, Spain
18-21 September, 1990
pp. 1199-1202
			
	Numero autori
	
				4
			
	Fulltext
	
				none
			
	Tutti gli autori
	
						Cosi, P; Falavigna, D; Mian, Ga; Omologo, M
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/16760

Citazioni

ND

ND

ND

social impact